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PREFACE 


Progress in the control of inanimate nature has been accom- 
plished in part by the development of more exact methods of de- 
scribing natuial phenomena This contiol of natuie has been 
particularly facilitated by the invention of instiuments of meas- 
urement. Similarly the contiol of human conduct and education 
depends on the development of moic exact methods of debciibing 
human conduct The exact dcsciiption of human conduct can be 
lendered most efficient when it is leduccd to a form of measuie- 
ment, for then small differences ate most accuiately poitrayed 
and small changes most accuiately noted. 

Treatises on psychological measurement have in the past dealt 
almost entiiely with human abilUies Tests and scales have been 
devised for measuiing the gencial level of intellectual ability, 
achievement in school subjects, and aptitude m various special 
abilities As these measuies have been applied to the measuie- 
ment of products of learning m school and to the estimation of 
capacity in the woikaday woild, it has become evident that cer- 
tain important educational outcomes wcic being neglected Tests 
might show how well a pupil is achieving in subject-matter, but 
would not at all show what intciests weic de /eloping, what atti- 
tudes were being foimed, the adequacy of his pcisonal adjust- 
ments, or the development of chaiactci A whole side of human 
nature has been neglected in the development of tests and sca'es 
of ability 

Elated with their success with the measuieinent of intelligence 
during the World War, psychologists turned their attention to the 
measurement of personality and character In a symposium on 
Intelligence and Its Measurement in 1921 many of the contribu- 
tors stated that one of the next steps in reseaich was the develop- 
ment of measurement of character As the late Dr. Colvin put it, 
“The most important ‘next step’ for purposes both of prognosis 
and diagnosis is the formulation of a test that will inform us of 
the character qualities of those tested.” Pintner stated. “I feel 
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that the time is now ripe for active investigation of the emotions, 
tlic character, the will, and so forth, by means of mental test 
methods ” Pressey said “Theie simply must be a courageous 
attack upon the problem of measurement of other than intellectual 
factors It is becoming increasingly obvious that matters of tem- 
perament and character are of very great importance, that they 
operate quite largely independent of intelligence, that prog nosis 
problems cannot be adequately understood without an evaluation 
of these factors” Teiman stated as one of the next steps in re- 
scaicli ‘‘investigations of instinctive, emotional and volitional 
tiaits and of the combinations of these which are involved in pre- 
psychopathic conditions and normal variations m temperament ” 
Thurstone said, “I should like to see another line of mental test 
w'ork opened up, namely the diagnosis of the volitional and emo- 
tional characteristics which determine our character traits ” 

A'lany of the methods and techniques described in this book 
have been patiently and quietly investigated in the psychological 
laboratory for many ycais But within the past decade all have 
been applied to piactical pioblems of human conduct with the 
result that earlier methods have been refined and extended They 
are now being tried experimentally or are in practical use in the 
work of the school, industry, clinic, court, and wherever else there 
are problems of human adjustment This book summarizes the 
research that has been done in these various methods and tech- 
niques Since the winter approaches the problem with a back- 
ground in educational psychology, the viewpoint, illustrations, and 
applications tend tc be largely educational The book is primarily 
addressed to educators wdiose breadth of view causes them to 
see conduct as a primary concern of education But since practical 
inteiest in the techniques described in this book is by no means 
limited to educators, any social scientist, whether in medicine, in- 
dustry, law, or religion, may find herein discussed methods and 
techniques for the more exact study of human relationships. 

Several threads of thought run through the discussion of the 
various methods which tend to give the whole a unified point of 
view The refinement of any technique indicates the need for 
adequate sampling The writer believes that progress in the diag- 
nosis of conduct inevitably leads toward standardization, and he 
has been at pains to point out the possibility and advantages of 
standardization wherever possible. The empirical approach to 
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the study of conduct must pei force be kept uppermost Though 
guided by the best cues and hypotheses available, any investigator 
must plead ignoiance as he delves into the unknown, and must 
depend on results to check thcoiy and dogmatism At every tuin 
pains will be taken to sticss the fallibility of naked human judg- 
ment in diagnosis If theic be anj- one guiding pimciple that rises 
above the othcis, it is the resolve to make use of relationships 
adequately prov'ed by careful experimentation instead of depend- 
ing on limited and unsystematic peisonal expciicnce 
The writer remembers hearing the mcasuicmcnt of non-intellec- 
tual qualities discussed j-eais ago as though ihcie were unlimited 
possibilities in the development of “chai actor tests But he is 
bold enough now to asseit that the main lines whieh such “test- 
ing” must follow are included in the chapter headings of this book. 
Certain fields, to be sure, have been omitted Hypnosis has poten- 
tiality both as a diagnostic and as a theiapeutic technique, but its 
development in this diiection has been veiy ludimentary The 
study of personal letters, diaries, essays, and autobiogiaphies is 
still another souice of evidence, the value of which has yet to be 
determined On the other hand the methods hciein described In- 
volve measurement of the environment as it conditions conduct, 
measurement of the actual conduct itself, and measurement of the 
results of conduct, they involve direct observation and interpre- 
tation of what IS observed, and the testimony of others than the 
observer concerning the findings, and they invoh^e the direct study 
of reactions and verbal testimony conceining reactions That there, 
must be some soit of evidence in the accuiatc description of con- 
duct IS indisputable, and this evidence must proceed from one 
or the other of the above souiccs But there is infinite possibility 
for development and lefinement 

An explanation should be made as to what this book does not 
include as well as to what it does include Tests of ability and in- 
telligence have been carefully av'oided, not because they are with- 
out significance in the diagnosis of conduct, but because they are 
adequately described and discussed elsewhere In several places, 
particularly in the chapters on interviewing, psychoanalysis, and 
case studies, it was difficult to avoid a discussion of the treatment 
and therapeutic value of these techniques However, since diagno- 
sis and treatment can be distinguished, so far as possible discus- 
sion has been confined to diagnosis It would be impossible to do 
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justice to the therapeutic values of these techniques within the 
limits of this volume 

The manuscript of this book was started in 1927. Since that 
time the remarkable Studies m the Nature of Character of the 
Character Education Inquiry have appeared. These studies are 
fundamental to an understanding of the natuie of conduct and 
to the future development of methods of investigation The writer 
wishes to express his debt to the woik of Hartshorne and May, 
uhich IS only too inadequately represented in this book Space 
does not pciinit acknowledgment at this point to the many investi- 
gators whose lescaiches bare thrown light on the problems dis- 
cussed, but reference is given to their work throughout the text. 
Certain contnbutois .-lard out as deserving special mention here 
D S Thomas and Willard Olson for the development of tech- 
niques of observation, Woodworth for the original psychoneurotic 
inventory, Goodwin AVatson, for both his oivn and his students’ 
contiibutions to the measurement and diagnosis of attitudes and 
adjustment, and finally Freud, whose development of psycho- 
analysis has influenced the techniques of diagnosing conduct in 
more ways than can be easily described 

The refeience lists at the end of each chapter contain most of 
the important leferenccs on each topic through 1929 Any one 
planning to do further wmrk on a topic should make a careful 
suivey of w'ork done in the interval after that date. 

The wwiter is under considerable obligation to Dr. E. L. Thorn- 
dike for reading the manuscript and for helpful criticism Dr. D 
G. Paterson and Dr Florence Goodenough have made valuable 
suggestions Above all appreciation is due to the general editor 
of the Century Psychology Series, Dr. R. M. Elliott, for expert 
criticism and advice. 
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Chapter I 

INTRODUCTION: SIGNIFICANCE OF THE DIAGNOSIS 
OF CONDUCT 

T he term diagnosis, as used m this book, has a variety of 
meanings At one extreme it means mere measurement or 
description — the desciiption of a person’s behavior, or of 
a person’s interests, or of his home. When used in this simple 
sense, diagngsis is a synonym for suivey At other times diagnosis 
means discovery — ^the discovery of individuals possessing certain 
characteristics We speak of the discovery of predelinquents, or of 
problem children, and insofar as this discovery involves the ac- 
curate measurement or description of certain characteristics of 
these individuals, it too becomes a fotm of diagnosis 

In a narrower and more specific sense, diagnosis refers to the 
more complete understanding of an individual and of his conduct. 
If a problem child in school has alieady been singled out for 
study, then to proceed with diagnosis is to ascertain further facts 
concerning him, his home enviionment, his companions, his wishes 
and desires, etc , always w ith a view to fathoming the particular 
circumstance or combination of ciicumstances which seems to be 
responsible for his condition In short, the teacher or psychologist 
wants to know why he is a problem child and what can be done 
about It. 

To solve this problem two sets of facts are needed. One must 
know first the facts with regaid to the child and second the sig- 
nificance of such facts as have a bearing on the situation in ques- 
tion, as distinguished from those which are merely the chance 
concomitants For instance, if a child is caught stealing, we want 
to know many general things about both him and his environment 
Then we also want to know the relationship of these factors in the 
child’s life to stealing so that their significance in the particular 
situation may be determined. 


3 



4 Diagnosing Personality and Conduct 

Nature of Measurement of Conduct 

In a discussion of human conduct many persons are confused by 
the reference to measuicmcnt as a piefened foim of diagnosis 
The most obvious evidence that can be gatheied with respect to 
human conduct is purely descriptive Of a pupil in school it may 
be said, for example, that he has a clean face and clean hands^ 
that he is courteous in class and on the playgiound, that he enjoys 
paiticipating in group activities, that he has no brothers and sis- 
ters, oi that he is trustwoithy and mdustiious Much of the clini- 
cal diagnosis resulting in the case study earned on in schools and 
in social work consists of this accuiate description of the occui- 
rcnce of isolated events for which there is no need for measure- 
ment. 

But mcie description of this simple sort does not satisfy all 
of the functions of diagnosis One is not interested merely m 
knowing that John came to school on Tuesday morning and tiied 
to bluff thiough a lecitation in his history class for which he was 
not prepaied One wants to know how frequently this sort of be- 
havior occurs in his vaiious classes, whether there is a similar 
tendency to bluff outside of school, and whether he exhibits other 
symptoms indicating that he is not in proper adjustment to thp 
work of the school. 

Again, merely to know that a man expiesscd his opinion that 
England does not sincerely want naval reduction, but wishes to 
gam an advantage over othei countiies, especially the United 
States, in the disarmament conference, mfoims us veiy little about 
the man We should want to know his reactions to many different 
kinds of social issues before foiming any sort of impression as to 
his fair-mindedness and his tendencies towaid liberalism or con- 
servatism From a single isolated event one is poweiless to predict 
the future In oidcr to pick a satisfactoiy employee, a good wife, 
a pupil showing delinquent tendencies, a pupil showing personality 
disturbances, or a good city mayor, one needs to know certain 
trends in the individual, at least enough to be able to predict 
with some assurance what to expect from the individual in a new 
situation. 

Thoindike has wisely said, “Whatevei exists exists in some 
amount and can be measured” As we pass judgments on people 
m daily life we are constantly making tjiese measuiements m a 
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rough way As we call this man honest, that man prompt, an- 
other well-balanced, another vam, and another introverted, we are 
making vciy ciudc measurements in the form of judgments Hart- 
bhoinc and May found that it was exceedingly lOugh and unsatis- 
factoiy to label people mcicly as honest or dishonest One wants 
to know whether a man is honest m this, that, or the other situa- 
tion, and under vaiious degiees of temptation on several diffeient 
occasions Measurement merely attempts to make these descrip- 
tions finer and more exact. Instead of the two classifications honest 
and dishonest, the scale might run from neve) honest, through 
seldom, sometimes, and usually, to always honest Or the findings 
might be “honest in fifteen out of t« enty situations in which there 
was opportunity to steal, cheat, or he ” In pioportion as the grada- 
tions become finer, one leaves qualitative description and adopts 
a form of measurement The peifecting of measuiement requires 
that attention be paid to the equality and evenness of units and 
to the zero point of the scale 

A key-note of this book, a point of emphasis to crop out again 
and again in the discussion of various techniques of investiga- 
tion, is the emphasis on adequacy of sampling A single observa- 
tion IS unreliable, a single rating is unreliable, a single test is un- 
leliable, a single measurement is unreliable, a single answer to a 
question is unieliable Reliability is achieved by heaping up ob- 
servations, ratings, tests, questions, measures Ask a boy to-day 
what he wants to do when he glows up, and he may lespond, “Be 
an aviator,” when a few ycais ago his choice would have been 
“Locomotive engineer ” Because a boy’s choices tend to be fickle 
and to change even week by tveek, we have him indicate his liking 
or disliking for each of a hundred or so occupations, activities, 
studies, and people Then the outlines of certain broad, fundamen- 
tal interests emerge which arc stable and impoitant If you ask one 
teacher for her judgment of a boy’s trustivorthiness, you obtain 
what she has been able to obseive in those few naiiow class-room 
situations that appeared when her attention was particularly di- 
rected to some act involving honesty. An adequate rating, on the 
other hand, requires the judgment of several raters in several 
situations at several different times. Reliable evidence must be 
multiplied evidence. 

Two main streams of investigation have developed in the field 
of conduct diagnosis. One may be called the experimental ap- 
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proach. Investigators using experimental techniques have in great 
part been interested in questions as to the natuie of behavior and 
character, and of the foiccs affecting the development of person- 
ality In geneial these mvcstigatois have been tiained m labora- 
tory techniques or in psychological testing methods and have 
employed statistical methods m the validation of the diagnostic 
techniques The experimentalist’s insistence on isolating the effect 
of a single variable, on eliminating or at least controlling ex- 
traneous factois, and on the value of large samplings has played 
an important part in the development of these diagnostic tech- 
niques. 

The other line of approach to the pioblem of diagnosing conduct 
may be called the cluneal. Since the primary purpose of investi- 
gators using clinical methods has been to render service, they have 
been more concerned with using the techniques in guidance, in 
curing the sick, in placing people in industry, and in keeping them 
out of conflict with society, than in studying techniques analyti- 
cally and ascertaining relationships between phenomena. These 
investigators have developed techniques that could be immediately 
used m a practical way in the work of the clinic, employment of- 
fice, school, or court Rating methods, the standardized interview, 
and psychoanalysis are typical pioducts of this group of workeis, 
and while these do not equal the questionnaire or testing methods 
in respect to accuracy, they have a demonstrable practical 
value. 

These two groups of investigators have not always understood 
each other, and therefore have often w^oiked at cross purposes 
Clinical workeis, desiring devices that could be used immediately, 
have failed to test these devices thoroughly before using them 
and have also lacked caution when mterpieting results Because 
their emphasis is concentrated upon interpreting a particular pres- 
ent situation, they tend to substitute for the painstaking analysis 
of statistical inquiry judgments and infciences based on clinical 
experience. Indeed, the clinical worker distrusts the conclusions 
drawn from statistics, they seem to have a sort of unreality for 
him, not inviting the confidence he places on things observed with 
his own eyes, felt with his own hands, or heard with his own ears. 
Then, too, he harbors the belief that statistics cover up differences, 
that since every individual presents a unique problem, is in fact a 
new bundle of forces existing in a combination that never existed 
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together before, statistical methods are too clumsy to deal with 
individuals To him nothing can be better than the naked human 
judgment trained by adequate clinical experience. 

In answer to this the expeiimentalist points out the fallacy of 
drawing conclusions from a single case or even fiom a small num- 
ber of cases The clinical worker may notice that a criminal has a 
peculiarly shaped skull and then conclude that all criminals have 
some sort of brain deformity The clinical worker may notice that 
some pupils in school who have reading difficulties have more or 
less severe cases of “congenital word blindness” and build on this 
observation a theory of reading disability. Some gifted children 
having been observed to be weak and puny, there arises the popu- 
lar generalization that bright childien are frail and sickly These 
observations become either trivial or false when large numbers of 
children are accurately observed under comparable conditions 

For such reasons the experimentalist is extremely skeptical of 
the value of the case study method for arriving at general truths 
based on observation and interviews. To him the experience gained 
from first-hand contact with actual cases has only a small chance 
of leading to valid generalizations unless, as is unlikely, this ex- 
perience has been based on a large number of cases under care- 
fully controlled conditions where accurate observation or measure- 
ment IS possible. 

A retort from the clinical woiker may charge the expeiimental- 
ist with merely talking theor}'- and belittle his tests and measures 
because they do not begm to furnish all of the evidence needed m 
treating a practical case and aie too clumsy or time-consuming 
even when they are available 

In conclusion it must be conceded that valuable diagnostic 
techniques have been developed by each school To the clinical 
worker we are indebted for techniques which describe the present 
situation and for promising suggestions and challenging hypotheses 
regarding the significance of these facts To the experimentalist we 
are indebted for the exhaustive, laborious checking-up of these 
hypotheses and the development of tested and standardized diag- 
nostic devices for measuring them To the experimentalist we 
must ultimately look for the accelerated progress in diagnosis 
which is to give us increased control over human affairs But in 
the practical affairs of the world to-day clinical workers are doing 
valuable service with makeshift methods of their own devising. 
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Areas Where the Diagnosis of Conduct Is Needed 

The interests of the two groups are fundamentally the same, 
for both aie interested m the development of devices which will 
help solve ceitain of the moie pressing problems m the control of 
human relations These problems may be listed under the head- 
ings of crime, insanity, vocation, and citizenship. 

Diagnosis of Crime. If society is ever to solve the problem of 
crime, it must have methods of studying the criminal The 
diagnosis of ciime and of the ciiminal may have several mean- 
ings. 

I. Diagnosis of incipient tendencies toward crime. Although 
scientific pioof is lacking, there is stiong presumptive evidence 
that most crime is not a spontaneous, accidental occurrence — a 
“sport” in human behavior — but is usually pieceded by peculiar 
behavior chaiacteiistics, which are its diiect antecedents extend- 
ing as far back as early childhood One may, therefore, speak of 
tests of predelinquency, tests that lecognize delinquent or ciiminal 
behavior while in the incipient stages before it assumes serious or 
tragic piopoitions. 

2 Prognosis of ciime. Tests piognostic of crime would not dif- 
fer markedly fiom the foimer group Though it is possible that 
there may be tests presumptive of a latei ciiminal career which 
would have little or no relation to present behavior characteristics, 
the existence of such tests is dubious Probably the best piedic- 
tion of later ciiminal behavior can be had from a diagnosis of 
present behavior. 

3. Diagnosis of the fact of crime This is what is ordinarily 
meant by the diagnosis of ciime Society usually pays no attention 
to crime or the criminal until a crime has been committed, where- 
upon the main question becomes “Did this individual, on whom 
suspicion rests, commit the crime 

4 What commitment shall be made of the individual convicted 
of a crime requires for its answer a certain type of diagnosis. The 
law has one answer to give, medical and social science another 
answer. 

5. Why did this individual commit the enme^ A study of the 
factots — environmental and social — antecedent to and causative of 
the crime should help society determine more rationally what is to 
be Its treatment of the criminal offender. 



Significance of the Diagnosis of Conduct 9 

6 Prognosis of improvement What are the chances that a crim- 
inal can be cuied’ If the theory is believed that crime is a sudden 
outbuist of a fleeting impulse, then it is dilTicult to make any sort 
of piognosis, if one believes that the criminal is biologically a 
degenerated human being, then the prognosis of impiovement is 
hopeless, but if it is believed that most crime is the result of 
learning determined by environmental influence, then one can 
estimate the chances that these same habits can be changed, 'or 
that these tensions and maladjustments can be corrected 

Diagnosis of Insanity. The pioblcm of the diagnosis of crime 
and that of the diagnosis of insanity are not enliicly separate, 
because many tendencies which cause a person to become an 
enemy to society aie due to deficiencies in emotional stability or 
adequacy of adjustment, 01 definite psychoses or neuioses which 
arc also recognized as symptoms of insanity We may list seveial 
meanings of the diagnosis of insanity as has been done for the 
diagnosis of crime 

I In the first place, most psychiatrists recognize that insanity 
is not a mushroom growth spiouting spontaneously under cir- 
cumstances of stress and strain Neurotic or psychotic trends, emo- 
tional maladjustments and similar inadequacies of behavior 
usually have their roots far back m childhood Where this is the 
case, these tiends aic recognizable in childhood or adolescence 
by one who is familiar with the symptoms Where these disorders 
are functional, they should be discovered as early as possible m 
the child’s life in order that some soit of remedial treatment or 
reeducation can be offcied The discovery of problem children in 
school and the diagnosis of then ills aie very important. 

2. Similar to the above is the prognosis of mental balance and 
stability Probably the same tests, questionnaires, 01 othei tech- 
niques which describe the adequacy of childhood adjustments will 
also be the best obtainable prognosis of future adjustment 

3. Tests are badly needed as mcasuies of the degree or serious- 
ness of mental unbalance to be used as criteria for commitment to 
mental hospitals. At the present time commitment is made on the 
certificate of a. physician with no supporting evidence or tests 
except the common-sense investigation of the case by the physi- 
cian and his j'udgment based on his observations. Although m the 
long run the interests both of society and of the individual are 
fairly safeguarded, .no one will dispute that such a hit-or-miss 



lo Diagnosing Personality and Conduct 

method, depending more or less on unstandardized evidence, 
means tliat there are nide differences in the decisions Many are 
committed as insane nho have greater mental stability than 
others ^ho are unmolested members of society. More definite 
criteria of insanity and more definite standards by which to judge 
Its degree are needed. 

4. We should have not only tests to reveal the fact of insanity 
but techniques to distinguish the kind of insanity. Little progress 
of definite scientific value has been made in this direction Insanity 
is still diagnosed largely by the chnical methods of noting symp- 
toms and making an assignment to one or another of the tradi- 
tional categories of insanity As a result of their examinations, two 
experienced psychiatrists may make entirely different interpreta- 
tions of tlie nature of a mental abnormality Before definite prog- 
ress can be expected, the present classification of mental disease 
<nust be superseded by a classification which is more in harmony 
with accepted psychological theories To-day all is hypothesis. 
Ever)' worker in the field has his own classification, and a classi- 
fication of mental disorders so fundamental as to command the 
respect of all has yet to appear. On the diagnosis of the type of 
insanity, however, depends the decision as to the type of com- 
mitment to be made and the treatment to be given. 

5. Prognosis After a commitment is made, there is need of a 
still further diagnosis of the chances of recovery. Here again the 
methods available are hopelessly inadequate Methods of dealing 
with insanity' are highly tentative, and consequently little more 
than a chance prediction of the course of the condition can be 
given. 

6. Intensive research into the causes of insanity and into the 
factors responsible for the increase in mental breakdowns is 
needed. The adequacy of such investigations will depend on the 
availability of accurate measures of the individual and his en- 
vironment, and means for determining quantitatively relation- 
ships between the various factors 

Diagnosis of Vocational Competence. In studies of vocational 
competence most attention has been paid to measures of ability. 
It IS only natural that the first consideration in hiring a worker 
should be his ability to do the task at hand. But it has been dis- 
covered that conduct factors are also important in estimating a 
worker's value. He must not only be able to do the job, he must 
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get along with people and also have certain character quali- 
ties of regularity, thoroughness, willingness to stick to the job, 
and the like. Of fundamental importance is the worker’s interest 
in a given type of work For instance, he might have the requisite 
skills and abilities to be a “tree surgeon,” but unless he has a 
definite interest in being out of doors, and in growing things as 
opposed to working with machines, with people, with books and 
papers, he would be unhappy and unsuccessful in the work Vo- 
cational counselors are aw’are that they need measures of interests 
and of tendencies toward lesponsibihty and regularity to supple- 
ment measures of ability, which have already proved their use- 
fulness. 

Diagnosis of Citizenship. A fourth area w'here the diagnosis of 
conduct IS needed is in the w'ork of the schools toward the de- 
velopment of good citizenship The more tests of intelligence and 
achievement are constructed, and the better they become, thf 
more painful grows the feeling that certain important educational 
outcomes are being neglected m the measurement program 
Schools are not solely interested in academic achievement and 
success in subject-matter Certain character qualities are also 
important outcomes or “concomitants”* of school woik. The 
development of measures of achievement has tended to over- 
emphasize the academic and scholarship side of schools, to the 
neglect of the wider social educational values. 

More specifically, education should lesult not only in the forma- 
tion of habits, skills, and informations but also m attitudes of 
acceptance or rejection In histoiy, for instance, pupils should 
not only learn certain facts regarding historical events and per- 
sonages, but they should form certain attitudes toward vital social 
issues. Tests of historical information have long been available, 
but measures of social attitude are a more recent development. 

The extra-curricular program is one part of the educational 
scheme for which there has been practically no means of evalua- 
tion. With growing recognition of the social values of education 
and the importance of the social life of the school in the develop- 
ment of habits of good citizenship and favorable personal ad- 
justments, there has come a need for the evaluation of extra- 
curricular activities. If the extra-curricular program is looking 
toward the development of qualities of leadership, then methods 

* Kilpatrick, W. H, Foundations of Method (The Macmillan Company, 1925 )• 
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of detecting the development of leadership arc needed. If the 
extra-cun iculai program aims at the development of qualities of 
cooperation, good spoitsmanship, and social responsibility, then 
means for noting individual differences in these qualities are 
needed There is in this country a National Honor Society for 
high school students, membership in which is determined by 
standing in scholarship, leadership, character, and service. Ways 
of measuring these qualities m high school students are much 
needed if membeiship in the society is to have any sort of objec- 
tive basis. 

Types of Evidence 

The following forms of evidence are described in this book: 

Observation 
Rating methods 
Questionnaire 
to measure adjustment 
to measure attitude 
to measure interest 
Tests 

paper and pencil tests of knowledge and judgment 
performance tests 
The free association method 
Physiological measures 
Interviewing 
Psychoanalysis 
External signs 

Pleasures of the environment 

Before these are described in detail in the succeeding chapters, 
there are certain general relationships existing between these 
different techniques, discussion of which may help in evaluating 
their significance. 

Measures of the Environment, Reactions, or Results 

These various types of evidence can be distinguished as de- 
scriptions of the environment, descriptions of reactions, and de- 
scriptions of the results of conduct Measures of the environment 
are important inasmuch as conduct is conditioned by the en- 
vironment. A rich, stimulating, socially adequate environment 
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leads to one kind of conduct, a poverty-stiicken, squalid, harsh, 
or ciuel environment leads to anolhei kind of conduct So im- 
poitant is the environment that it can be used to help evaluate or 
e\cn pi edict conduct Delinquency flounslies in one kind of en- 
vironment, good citizenship in another Pupils coming from good 
homes with thoughtful parental supcitision, attending good 
schools with an atmosphere of social coopeiation, and having 
access to adequate chuiches and plajgiounds aie almost certain 
to exhibit dcsiiable chaiacteristics. 

The most obvious way to diagnose conduct is to desciibe reac- 
tions diiectly, which becomes the second tvpe of diagnosis. The 
veiy conduct itself may be on the one hand directly described 
either by observation and tests, or thiough judgment by rating 
methods, or on the o' her hand it may be obtained indirectly, either 
thiough interviews with the inditidual conceined or tvith those 
acquainted with him, or thiough answers to questionnaiics. 

Descriptions of the results of conduct have not pioten so fruit- 
ful as diagnostic techniques Conduct is supposed to leave its 
imprint in various physical chaiacteiistics such as facial cxpiession, 
handwriting, postuie, and other external signs, but careful ex- 
perimentation has not revealed these relationships ^ Many have 
thought also that conduct could be diagnosed by studying the pro- 
ducts of activity, as for example the lesiilts of certain tests, but 
these methods again have yielded little of significance. Conduct 
IS best studied by catching it in the process. 

Direct vs. Indirect Attack. 

The frontal attack on the description of conduct is often not 
possible. The obseivei 01 cxaminci is always a factor in the situa- 
tion, and since conduct is so sensitive to its sui roundings, it be- 
comes altered by the piescnce of the cxammei Because peisons are 
always trying to make a good impicssion, just as soon as the ex- 
perimenter announces that he is trying to find out how people 
behave the behavior itself changes One cannot announce to a 
group that he is conducting a test of cheating, for then no matter 
what else is done, the temptation to cheat is inhibited. 

There are times when the direct frontal attack is satisfactory. In 
cases where there is nothing to hide or conceal, the conduct ob- 

• Paterson, D G , Physique and Intellect (The Century Co , 1930) 
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served may be quite naive and natural. Persons who enter an 
Interview in order to benefit themselves have every incentive to 
describe the actual state of affaiis On the other hand, m the 
involuntaiy interview one must provisionally accept testimony 
at its face value, only to compare it for coiioboration with other 
evidence as opportunity occurs In free association malingering Is 
possible, but with a reasonable amount of cooperation on the part 
of the subject, the conditions of the experiment almost completely 
determine the response What seems to be a perfectly free choice 
on the part of the subject is really determined by the conditions 
of his mind In the questionnaiie, however, where the pressure of 
time is removed, theie is a possibility that answers may be given 
for the effect they produce on the examiner If the questionnaire 
permits the individual answering it to show himself to advantage, 
then there will be every incentive to tell the truth But if truthful 
answers reveal things \\ Inch may be condemned or criticized, then 
the temptation to falsehood becomes very stiong 

In such cases the disguised test or questionnaire becomes im- 
perative Disguise IS quite possible in the case of the question- 
naire, Inasmuch as apparently harmless or trivial questions often 
have the power of disclosing significant tendencies when taken 
with other similar questions If the questionnaire is headed “In- 
terest Quc'tionnaire” or “Personal Data Sheet” or some other In- 
nocuous title. It may soothe any suspicion that It is measuring 
fundamental habit systems, adjustments, or characteristics of the 
individual. 


Voluntary vs. Involuntary Diagnosis 

When a patient goes to his physician for advice and treatment 
he IS in a very different frame of mind from the person who has 
to fill out his income tax blank or testify in court The persons 
who voluntarily seek out an interview with some counselor to 
obtain guidance in a personal problem are already free from an 
initial resistance to complete frankness. To be sure, there is an 
unavoidable human tendency to picture things in as rosy a light 
as possible, but however much the testimony departs from the 
facts. It IS an unintentional misrepresentation. Those who under- 
take to be psychoanalyzed have no alternative other than that of 
revealing the truth. On the other hand, if a person Is summoned 
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to an interview against his will, eitlier before he is ready to admit 
his fault or because he is the cause of some othei poison’s distress, 
then there is an initial lesisiance and defense that must be broken 
down The dean who calls m a student in disciplinary difficulty 
must first persuade the student that he is friendly, that he is fair, 
and that he can be of some benefit to the student before the 
student will reveal himself candidly There must exist a certain 
amount of mutual understanding and cooperation between an 
investigator and his group of subjects before they will answer a 
psychological questionnaire truthfully. 

Standardization vs. Versatility 

Another issue in the diagnosis of conduct concerns the matter of 
standardization. The practical social worker or counselor is some- 
what impatient with attempts to standardize diagnostic techniques. 
To be sure, he will use the Binet Intelligence Test, welcoming the 
accuracy and impersonality which its standardized technique 
affords, but in probing into the recesses of individual conduct or 
into complexes, inhibitions, and obsessions, he may become some- 
what impatient with tendencies toward standardization, and skep- 
tical of their value In the interview there is a tendency to allow 
the freest sort of intercourse between subject and intei viewer. 
While the subject is urged to cxpicss himself openly, to air all his 
grievances, to make complaints without restraint, the interviewer 
demands the right to follow up clues wherever they lead He claims 
that a prepared schedule of questions can never anticipate the 
detail that will be necessary to complete the personal story at vital 
points. The interview'd', he thinks, must discovei the plot as he 
goes along. No prepared schedule can possibly be so adaptable as 
to fit all the exigencies of highly individual situations 

The weakness in this point of view lies in its overconfidence. 
Psychology has taught us much concerning the fallibility of hu- 
man observation, perception, inference, and judgment. The inter- 
viewer overlooks his disposition to neglect certain points of the 
evidence presented in favor of other points; his preliminary bias 
toward the problem in the shape of pet theories, to support which 
he is constantly on the watch; and his tendency to stop short of 
a complete survey of the situation The scientist is aware that 
these human weaknesses can contaminate even the description of 
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mr-nimate phenomena and recognizes the victory that has been 
won in the use of exact methods of measuiement and experi- 
mentation I'lic tendency towaid standaidization of the diagnosis 
of conduct IS an attempt to take the fallibility out of human 
judgment, the partiality from observation and the tendency to 
generalize on inadequate evidence out of conclusions. The psycho- 
logical qucstionnaiic or case study schedule helps the mvestigatoi 
base his results on the answeis to a laigc number of questions 
rather than a limited number They help him survey all corners 
of a field lathei than become satisfied with uncoveiing the obvious 
and convenient facts and then slopping shoit The questionnaire, 
paiiiculaily, enables one to make use of proved relationships be- 
tween answeis to a question and inferences from these answers, 
as against the unproved “hunches” that every worker gathers 
in the course of his expeiience Finally the questionnaire, instead 
of depending for accuracy on the bieadth of experience of the 
investigator, makes accurate and objective comparisons possible 
belucen the answers of one indiviJiud and those of other indi- 
viduals or gioups. 

The Question o: Interpretation 

In the last section the matter of interpietation was mentioned 
as one of the main difficulties that has to be contended with in 
the diagnosis of conduct This has not received the full considera- 
tion which It deseivcs The practical worker in the field tends 
to discount altogether any suggestion that his interpretation of 
his observations and inquiries is open to ciiticism Perhaps a cock- 
sure attitude IS necessary foi any one who is going to be influ- 
ential in practical affairs wheie the situation is usually pressing, 
and where the tendency is to give one’s best judgment and ad- 
vice and then back it up with the force of positive authority. 
Psychological expeiimcnts, however, indicate that grave and seri- 
ous errors lurk around every judgment and inference. These will 
be treated more at length in subsequent chapters Suffice it to say 
that previous experience and our habitual modes of thought tend 
to warp every judgment made more or less seriously. Genuine 
progress can be made only on the foundation of patient investiga- 
tion into lelationships, and practical diagnosis can be made ac- 
curate only in piopoition as the investigator is willing to be thor- 
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ough and exhaustive and to defei his private judgment to the 
tested results of experimental inquiry 

In the diagnosis of the insane, of delinquents, and of problem 
children in courts and mental clinics it is customaiy to employ 
both psychiatrists and psychologists The former have authority 
and prestige and are better paid, the activities of the latter are 
narrowly restricted to giving mental tests and recoi cling observa- 
tions of behavior and characteiistics of the patient during the test. 
The techniques employed by the psychologist aie thoroughly 
standardized, and his lepoits arc highly objective On the other 
hand the psychiatrist, in whose hands the disposition of the case 
finally rests, uses the most subjective of methods, even where 
ceilain neuiological and medical tests aic employed The psychi- 
atric examination propei is nothing moie than a loosely conducted 
inteiview whose diiection is determined as it piocceds In con- 
trast to the psychological examination, in which standardized 
methods are used, the inquiiy into the patient’s wishes and desires, 
his thwai tings, modes of compensation, moods, obsessions, and 
the like, is highly subjective The conclusions icachcd by the 
psychiatiist depend foi then value upon his fairness, thorough- 
ness, and experience 

Verbal vs. Non-verbal Diagnosis 

Before language has developed, the only way in nhich conduct 
can be studied is by observing it diiectly This method of ob- 
servation retains its usefulness as a fundamental technique for 
the study of all ages But obseivalion is time-consuming and la- 
borious, and therefore as language develops in the child, it be- 
comes a temptation to ask the child about himself Later, m the 
study of adults, it is natural to use a person’s testimony concern- 
ing his own behavior — indeed, it would be unnatural not to do so. 
This substitution of testimony for direct observation carries with 
it certain implications which should be clearly understood To 
ask a child about his interests, his likes and dislikes, his habits 
and skills. Is to ask him about things that he peihaps has never 
observed The greater part of a person’s conduct originates with- 
out the aid of his verbal 01 ganization, and for this reason a special 
act of learning is necessary befoic a pci son steps out to observe 
himself objectively. To the adult who has grown accustomed to 
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considering his own pioblems it seems only natural that the 
child who has the tool of language at his command can use it with 
personal refeience However, this assumption needs to be tested. 

Along with the question of the possibility of self-obseivation 
goes the matter of the voluntary alteration of language to fit pur- 
poses. It is relatively difficult to change conduct itself in the pres- 
ence of the examiner, for though one can act hypocritically for a 
short time, sooner or later the true response will reveal itself. 
But language is ever so much more versatile than conduct There 
need be only the loosest connection between language and con- 
duct, between uhat a peison says he will do and what he actually 
does, between what he says he believes or likes and what he 
actually believes or likes as shown by his conduct 

Notwithstanding this thread-like relationship, the results of 
questionnaiies and interviews have demonstiated their useful- 
ness But the empiiical appioach wdiich is referred to again and 
again throughout the pages of this book must be emphasized 
here Answers to a questionnane must be correlated with other 
known chaiactenstics of individuals, and the answers must then 
be interpietcd in the light of these correlations Answers must not 
necessarily be taken at face value, foi their implications are often 
subtle and obscure The findings of statistical inquiry often re- 
veal significances to questions and answci s not dreamed of by the 
investigator Verbal evidence is to be valued highly in the diag- 
nosis of conduct It is of all foims of evidence the easiest to ob- 
tain, to record, and to study objectively But its significance must 
be tested and discovered. 

Honesty 

At several points in the previous discussion the question of 
honesty arose In the indirect appioach to conduct diagnosis, the 
question of honesty does not entei, but where direct questions are 
asked a subj'ect and the diagnosis depends on the truthfulness 
with which he answers, honesty looms large as an issue Evidence 
is slowly accumulating concerning the honesty with which chil- 
dren testify concerning themselves It is by no means too per 
cent. Sometimes the error may be ascribed to ignorance, some- 
times to purposeful deception based on a definite desire to con- 
ceal unpleasant truths Honest replies depend in part on the 
voluntary nature of the testimony, and in part on the degree to 
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which the subject believes his testimony will be appiovcd In- 
stead of condemning questionnaiics in a wholesale fashion be- 
cause they may yield dishonest evidence, we should assume the 
attitude that here is a distuibing element which must be studied 
and Its influence discovered under varying conditions so that it 
may be controlled or allowed for. 

Standards of Conduct 

When one commences to investigate conduct he enters the field 
of morals, and as soon as an issue becomes a moral one, the 
question arises. By what standards do you judge conduct^ 

Insofar as possible in this book, the matter of standards Is 
avoided The mam emphasis in diagnosis should be upon descrip- 
tion rather than pidgment, upon the endeavor to describe conduct 
fully and accurately without trying to evaluate it or pass j'udg- 
ment upon it In practical affairs, however, standards of j'udgment 
and evaluation are necessary. Society sets up rules of conduct to 
guide its affairs, and these become embedded unconsciously in its 
mores, or embodied consciously in its laws. Eveiy institution — ^the 
school and home particularly — sets up standards of conduct to 
guide Its members In this book these standards are accepted as 
they have been formulated by students or lawgivers. If they are 
standards of conformity, then there aie methods of rating and 
observation for studying these facts' If they are standards of 
adjustment or integration or efficiency or self-expression, then 
there are various techniques for discovering these facts In meas- 
urement the scale defines the standard, and the piocedure then 
becomes one of merely stating the facts in teims of more or less 
on the scale This book is concerned less with what to measure 
than with how to measure 

Occasionally, when one must make some sort of evaluation of 
the Items of a test or questionnaire before it can be used to meas- 
ure, experts or qualified persons are called upon to decide as to 
the direction or tendency of various answers to questions If one 
is studying knowledge of health conduct, then experts must de- 
cide what answers represent acceptable health standards. If one 
is using questions to measure the liberalism-conservatism of a 
group, then the questions must be evaluated by competent persons 
who decide which answers lean in the liberal direction and which 



20 Diagnosing Personality and Conduct 

in the consetvative There are also cmpiucal methods of deter- 
mining the diicction tendency of answcis to questions by tabulat- 
ing the lesponses of diftcicnt gioups alieady socially selected ac- 
coiding to the standaids which one is measuiing For instance, the 
different ways in which delinquent and non-delinquent groups an- 
swer various questions enable one to determine the significance 
of these questions as mcasuics of delinquent tendencies. 

In the case of ptiiely moial issues wheie standards are deter- 
mined by the expeit judgment of some group, it should be kept in 
mind that moial standards arc purely relative, and that no group 
can claim aiithoiity in the pionounccment of moral standards 
Since Hartshoine and May, in oidei to constiuct a scoring key, 
refeired their tests of knowledge of right and wrong to a group of 
educators interested in chaiacter education, the tests represent 
nothing more than the standaids held by this one section of the 
community. 

Standards of conduct are most trustwoithy when based on ob- 
jective evidence that they satisfy certain accepted values. For 
instance, if health is an accepted value, then scientific investiga- 
tions which yield evidence of the healthful qualities of activities 
certify to the acceptability of these activities Likewise if effi- 
ciency IS an accepted value, efficient behavior can be found by 
experimentation To discuss the question of standards adequately 
would requiie a volume in itself In this book methods of diag- 
nosing conduct aie discussed The othci issue of validating this 
conduct must be left for a subsequent treatment 

Reliability of Diagnosis 

A primary concern of this book is the accuracy or reliability 
of diagnosis or measuiemeni In every chapter pains have been 
taken to comb carefully the technical literature in order to dis- 
cover and present the evidence bearing on reliability. 

The significance of the coefficients of correlation which show the 
agreement between repetitions of the measures of the same phe- 
nomena is discussed in books on statistical methods There are 
two types of evidence on reliability One is the correlation of one 
half of the measuring instrument against the other half (the halves 
chosen at random) giving a measure of the consistency of the in- 
strument, The other is the correlation of the measurement with 
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a repetition of the same measurement on the same individuals at a 
latei date This involves not only the consistency of the instru- 
ment but also and pnmaiily the consistency of perfoimance of 
those being measuied In measures of ability the latter is the more 
important because we want to know not only how consistent from 
part to part the instiument is but how reliable a single applica- 
tion of the test is foi mcasiiiing an individual under all times and 
circumstances In the diagnosis of conduct, however, one does not 
expect neaily as much consistency of individual peiformance as 
he expects in the mcasuicmcnt of ability Where abilities change 
only very slowly, conduct may shift rapidly with the changing 
situation Hollingwoith found that neurotic Woild Wai soldiers, 
tested immediately aftei the armistice, made responses to a ques- 
tionnaire quite diffeient fiom those they had made before. 

But one is concerned pumarily with the consistency of the in- 
struments thcmschcs, foi he is nevei quite sine of the real exist- 
ence of many of the categoiics by which human natuie is described 
until these have been tested Foi instance, it is claimed that an- 
swers to certain questions aie symptomatic of introversion or ex- 
troversion If a list of these questions is assembled and given to 
a group of people, this assumption can be tested If the questions 
are sorted into two gioups in a chance mannei, and individuals 
who answei the questions in one gioup with a decided tendency 
toward introversion, anstvei the questions in the olhei gioup with 
no tendency at all, much doubt is thrown on the ability of the 
questionnaire to show the intioveision-extioveision difference If 
on the other hand, pci sons showing stiong intioveision tendencies 
on one half of the questions also show these same tendencies on 
the other half, the instrument opeiates with high consistency or 
reliability The cocflicient of corielalion between the scoies on 
the two halves mcasuies this tendency toward reliability. The 
reliability of a diagnostic technique is one of the most nnpoitant 
facts we can know about it If it is unreliable, it is not measuring 
anything accurately and is useless in its present form However, if 
the technique has promise of being significant, the unreliability 
can perhaps be remedied by extending the sampling or standard- 
izing the proceduie 

Do we “traitify” human experience by our methods of measure- 
ment? Do we assume fiom a method of measurement whereby 
we ask a number of questions and then proceed to total the an- 
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swers so as to yield a total score that there is something m human 
nature that corresponds to this score ^ It must be admitted that 
this is oftentimes the case It is easy to fall into the trap of first 
positing a trait, such as persistence or dependability, analyzing 
the trait into a number of specific situations in which it might 
manifest itself, framing these in question form, and finally com- 
puting a score on the basis of answers which indicate the presence 
of the trait. This method makes the unwarranted assumption first 
of all that the trait actually does exist 
One must be constantly on his guard in measurement not to 
create falsely by the very names which he gives to his tests a 
fictitious relevance or significance which they do not actually have. 
Psychology has suffered grievously in the past fiom an exuberance 
of imagination on the pait of its builders which has led them to 
create concepts with no corresponding reality as proved by their 
demolition by later experimentation Hypostatizahon is the name 
the logicians give to this fallacy The whole faculty psychology 
was erected by assigning a reality, concreteness, and independence 
to certain concepts of mental life that subsequent investigation 
shows do not have a corresponding independent existence. Like- 
wise the presem tra‘it psychology' is under the very strongest suspi- 
cion of being similarly unfounded This should put every investi- 
gator on his guard against ci eating psychological fiction by the 
very names which he assigns to his tests 
The check for detecting this fallacy is measurement of the 
reliability of the tests, questionnaires, or other instruments used 
If the instrument has high reliability, this is evidence that there is 
something there in reality, even if the name assigned to the test 
may not properly describe it One may speak of a questionnaire 
to measure introversion-extroversion when in reality it may merely 
measure the subject’s tendency to think of himself as introverted 
or extroverted. What the instrument really measures must be 
determined by the correlations of the instrument with other fac- 
tors. But the reality of the confact * must be determined by the 
high reliability of the instrument. 

•Symonds, P M, The Nature of Conduct (The Macmillan Company, 1928). 



Chapter II 
OBSERVATION 

A S a technique of investigation, observation is of prime im- 
portance. All experience, all data, must enter through the 
portal of the sense organs There is nothing that can be 
known or studied that does not have fiist to come under observa-i 
tion This truism is often ovetlooked because sometimes we make 
our data so accessible, so easy to read and interpret, that the 
pioblems of observation reduce almost to the vanishing point 
For instance, when we reduce the measuiement of weight first to 
the simple manipulation of scales so that they come to a balance 
and then to the reading off of a figure maiking the balancing 
point on the measuiing scale, errois due to observation are negli- 
gible On the other hand, if we have to judge weight by lifting 
another peison in our aims, eiiois of obseivation are consider- 
able Again, if we reduce a pei son’s ability to a senes of written 
responses in the form of ciosscs or undcilmings to certain stand- 
ard exercises, and we scoie these icsponscs by comparison with a 
previously prepaied key, the only eiiors of obseivation are those 
which result from eiiors in checking the lesponses against the key 
or in counting the coiicct responses But if we have to depend for 
our estimate of a pci son’s ability on scattered impressions based 
on casual observation of his behavior in whatever situation we 
can find him, our judgment mil be extremely liable to eiioi. 

Advance in scientific method has consisted in large part In de- 
vising instruments and techniques that help reduce the criors of 
observation. Sometimes phenomena aie magnified so that they 
can be more easily grasped by the senses, permitting smaller dif- 
ferences to be observed, as with the telescope and microscope 
Sometimes differences are brought closer together in space and 
time so that comparisons and discriminations can be more ac- 
curately made. The most efficient method of eliminating errors 
of observation is to reduce the phenomena to a permanent form 
to be studied at one[^s leisure and convenience, so that errors due 
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to observation “on the run” and to faulty memory may be 
at oided 

Many of the techniques desciibcd in this book are fairly objec- 
tive, so that errois of obscivation play only a minor part in 
them But the piogrcss of the science of measurement of conduct 
has not pioceedcd far enough to provide techniques for the objec- 
tive measurement of most of the phenomena of conduct We must 
still rely laigely on diicct obseivation of conduct itself, employing 
all of the cautions and safeguards and improvements that have 
grown up m the technique of obseivation 

In .studi m g the behavior of infams, mvestigators_have had to 
depend piincipally on obseivation At this early age articulate 
lesponaes aie not available, and hence cannot be used as a source 
of etidence Consequently most of the advance in the technique of 
obscitation hails fiom the nuisery schools and child development 
institutes Valuable contributions have been made in this field by 
Olson (8)‘ and D S Thomas (ii) But the value of dirert ob- 
seivation IS not limited to the eaily ages, and indeed th^echniques 
to be des'ciibed might well be borrowed~and applied to t he study 
of all a ges Direct obseivation is a ftindamenfaT source of data 
which IS at the same time a welcome complement to, and a check 
upon, the sometimes too heavy reliance on tests and question- 
naires. 

In this chapter we shall first discuss the fundamental condi- 
tions of obsciving and peiceiving and analyze the errors that fre- 
quently occur Next the question of selecting and defining what 
is to be observed will be discussed This will lead to a treatment 
of sampling, units of measurement, and reliability in observation. 
Finally, certain practical questions conceining observation and 
methods of keeping records will be considered. 

Essentials to Observation 

The first prerequisite to accurate observation is the possession 
^of efficient sense organs. In practice it turns out that most observa- 
tion is visual The eyes are said to be the most sensitive of all 
of our sense organs, and a good pair of eyes with lenses that 
focus, both at near and far distances, and without astigmatism. 

Numbers in parentheses refer in each case to references at the end of the 
chapter. 
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IS an mipoitant adjunct foi successful observation Glasses may 
collect certain defects, while at the same time adding difficulties 
Color-blindness is a defect of the eye which renders complete if 
not accurate observation impossible Of course not all sense data 
may come by sight, and hence acute and sensitive sense organs 
for hearing, taste, smell, touch, pressure, and temperatuie, not 
to mention the kmsesthetic senses, are necessary for complete 
observation. 

A second essential to good obseivation may be summed up 
under the term alertness To make a coriect observation one must 
attend to the object being studied To diiect attention to the right 
place IS not the simple and obrious matter that it may seem In 
obedience to the laws of attention, competing stimuli often at- 
tract the attention to the wrong place or at the wrong time 
Psychologists tell us that we attend to changing stimuli, that, 
other things being equal, the stiength of the stimulus is a prepo- 
tent factor; that repetition of the stimulus dulls the attention, 
and that striking quality or definite form attracts Working against 
these, competing inteiests or competing stimuli distiact 

The fact that the attention is diawn powerfully and com- 
pelhngly to certain stimuli rather than others is made use of 
by every magician In stage tricks the magician usually manages 
to divert the attention by his own movements and the diiection 
of his own gaze while an occupied hand is shifting the caids or 
putting the rabbit m the hat Munsleibeig (6, p 29) demon- 
strated this experimentally on the lecture platfoim of his class- 
room as follows. 

“I stood on the platfoim behind a low desk and begged the 
men to watch and to desciibc e\ciything which I was going to 
do from one given signal to anothei As soon as the signal was 
given, I lifted with my right hand a little revolving wheel with 
a colour-disk and made it run and change its color, and all the 
time, while I kept the little instiumcnt at the height of my head, 
I turned my eyes eagerly tow'ard it While this was going on, up 
to the closing signal, I took with my left hand, at first, a pencil 
from my vest-pocket and wrote something at the desk, then I 
took my watch out and laid it on the table, then I took a silver 
cigarette-box from my pocket, opened it, took a cigarette out of 
It, closed It with a loud click, and returned it to my pocket; and 
then came the ending signal The icsulls shoived that eighteen of 
the hundred had not noticed anything of all that I was doing 
With my left hand. Pencil and watch and cigarettes had simply 
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not existed for them The meie fact that I myself seemed to give 
all my attention to the colour-wheel had evidently inhibited in 
them the impicssions of the other side Yet I had made my move- 
ments of the left arm so ostentatiously, and I had beforehand so 
earnestly insisted that they ought to watch every single move- 
ment, that I hardly expected to make any one overlook the larger 
part of my actions.” 

In the hurly-burly of a complex situation it takes considerable 
skill to direct the attention properly to the point of critical issue. 
Many are unable to enjo)'^ a football game because they do not 
know how to observ'e “Beautiful mterfeience,” “a hole between 
center and left guard,” “an incompleted pass” are mysterious 
teims because the attention is m the wrong place So in a class- 
room one may not direct his attention properly to discover how 
pupils react to the complex social situation In describing her 
experimental work with nursery children Miss Thomas (ii, p. 6) 
notes, 

“The ordmaiy observational technique was shown to be totally 
invalid as an instrument for recording behavior for research pur- 
poses A check-up was made by having several observers make 
diary records of the social behavior of given children. There was 
a marked tendency for a given recorder to note one aspect in 
the record, another in the next record Although trained to make 
objective records in the sense of including overt reactions only, 
the result was necessaiily subjective in that the recorder, usually 
unconsciously, selected specific parts of the total behavior act 
This selection became, furthermore, invariably inconsistent over a 
period of time The obvious solution to this difficulty was to break 
up the behavior-complex into relatively simple units which would 
enable a record to be made of every recurrence of one of these be- 
havior units ” 

third essential to good observation in some situations is the 
ability to make reasonably accurate estimates without the use of 
special instruments. When they are available, measuring instru- 
ments help us to dispense with some of the skill necessary for 
discrimination This ability to make accurate estimates is some- 
thing that may be improved by training Indeed, in much of their 
casual experience young children are increasing their ability to 
make accurate estimates by observation They must learn to 
estimate distances and to gage then movements accordingly, learn 
how far to reach out a hand, or how long a step to take. But these 



Observation 


27 


estimates, accurate enough for the practical business of living, 
especially since an error is easily coriected by a new adjustment 
of reaching a little farther oi taking another step, are woefully 
inaccurate when we must make a record of our impressions, espe- 
cially initial impressions Similarly, judgment or estimates of 
number, height, weight, volume, intervals of time, brightness, 
color, pitch, volume of sound, tempeiature, taste, smell, and the 
like may be and usually are extremely inaccurate Such estimates 
based on observation are shown clearly to be in error when com- 
parisons are made between them and objective records. 

The ability to make accurate observation is something that 
can be increased by practice up to the point, indeed, wheie some 
persons by long expeiicncc and diiccted practice have gained 
remarkable skill in accurate observation. Winch showed haw 
the ability of school-childicn to make random observations 
can be increased measuiably Every one, by devoting atten- 
tion to the matter and piactising, can become highly accurate 
in ability to estimate distances, lengths, weights, volumes. 


e^ 

1 / k fourth essential in observation, the capacity to make fine dis- 
tinctions, grows directly out of the previous point Accuracy in 
estimating leads to accuracy in making comparisons One who is 
able to estimate accuiately the length of a line is also thereby 
able to judge more accurately which of two lines is the longer. 
The mother is alert to small changes in hei son’s behavior; she 
recognizes even slight irritability as a possible symptom of ill- 
ness or fatigue The intei viewer learns to detect slight indications 
of emotion or nervousness in an individual and uses these signs 
to help direct his questions to bung out significant items of in- 
formation Just as some persons at a concert appreciate every 
shade of interpretation of the artist or cringe at technical errors, 
while others applaud the good and the bad equally, so certain of 
us remain obtuse to the social forces at play in a situation, while 
o^ers are sensitive to every nuance. 

1/ A fifth essential to accurate observation is freedom from vari- 
ous pathological states A fatigued person cannot observe ac- 
curately; his attention lapses, and things pass by unheeded Alco- 
hol and drugs have well-known effects on the power to observe 
They lessen accuracy and impair the balance that is necessary for 
correct weighting of the various factors involved in correct inter- 
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pretations Fceble-mindedness or Insanity renders a person incom- 
petent to make accurate obseivation 

Still another essential to obseivation, the sixth in our list, lies 
Tn making an immediate and accmate record It is well known 
that memoiy fades The clear-cut peiception which one may have 
at the moment dims with the passage of time, and the memory 
image becomes indistinct Testimony in court is often inaccurate, 
not because the original obseivation was faulty, but because in 
the interval between the event and the recall the memory of the 
event has grown dim Any one who wishes to preserve the results 
of accuiate observation should plan to make an immediate record 
of them 

A seventh essential to good observation lies in the ability to 
peiceive accuiatcly, for when that stage of observation which is 
called peiception is reached, the opportunity for error increases 
ticmendously In peiception we make an interpretation as to 
the significance of what our sensory data includes What we see 
or heal is taken as a cue of the piesence of a whole object or 
situation of which the stimulus is only a pait This tendency to 
generalize on the basis of a limited expeiience is utilized by the 
radio, which employs such devices as the slapping of two sticks 
toegther to indicate a revolver report, or the roll of a drum to 
indicate distant thundei These intei pretations of sensory stimuli 
are laigely a matter of habit 

Now the observation of human conduct is full of such inter- 
pretations which are at the same time necessary and liable to 
erior and unreliability The doctor diagnoses a disease on the 
basis of a few symptoms, the psychiatrist diagnoses a neurosis or 
a family situation on the basis of a few questions and observations 
But the opportunities foi illusion aie numeious The child buried 
in a book may not be studying, but day-d learning, the awkward, 
clumsy girl may have been growing so lapidly as to outstrip her 
powers of muscular coordination, or she may be trying to attract 
attention. We tend to credit the child with pleasant expression or 
a sparkling eye, or the child who wears glasses, with intelligence. 
We fail to look for the brightest child in a class-room among the 
youngest and smallest The boy who gams our applause by some 
cheap trick of foolery is really covering over some other weakness 
or deficiency from which he wishes to distract our attention. 
Heinlein (4) found that observation of children’s rhythm in 
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marching was much distiirbcrl by the rhythm in the music which 
accompanied the maiching To obseive correctly is one thing — 
to mteiprct what we see is a different matter. 

AIiss Thomas (ii, p 9) notes inaccuracy in perception in her 
experimental work as follows: 

“One reason for oui low icliability on the mimbei of social con- 
tacts (irrespective of timing, the Peaison r’s ranged from only 47 
to 80 for the six obscivcrs in the preliminaiy reliability study) 
IS our neglect to diffeientiate between the merely spatial and func- 
tional social relationships. Hence, when a given child appi cached 
anothei, the recordei made his otvn interpretation of whether it 
was a genuine social contact These low' coefficients of con elation 
aie a beautiful example of how' unicliability cieeps in wheie inter- 
pretation IS permitted the rccoider” 

To state as an eighth essential of good observation fieedom from 
'piejvdice 01 from liabils of uiteipfetation is perhaps merely to 
reinforce the value of correct perception But to place stress on 
the habitual natuie of our perceptions is important We grow up 
to see things from the point of view of the use we make of them, 
our relations to them, or we make analogies in interpreting new 
phenomena in terms of our lesponse to the familiar The hen 
responds to the shiny oval piece of porcelain in her nest exactly 
as to a real egg, and lays another beside it Hunters use decoys 
to attract their game or feign the cries of the animals which they 
wish to shoot It IS most difficult foi most persons to describe 
without interpieting, yet this is the essential of good observa- 
tion The teacher brands a pupil as lazy, but unjustly What she 
should do is to obscive and describe his behavior as well as its 
antecedents and then survey the possible intcipretations — illness, 
bad habits, competing interests, inability to cope w'lth the task, 
etc — before deciding on one Every supcrvisoi w'ho observes a 
class-room lesson enters the situation full of prejudicial habits of 
thinking. He may be a supei visor of the old school w'ho is hoi- 
rified by the least sign of disorder, or a supervisor of the new 
school who is shocked by regular rows of desks and a formal ques- 
tion-and-answer recitation So completely is observation warped 
by interpretation that two visitors to the same class may make 
quite conflicting reports If supervisors could be taught to observe 
teachers, or teachers taught to observe children, accurately and 
precisely, and then make their mterpreta Lions, judgment of con- 
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duct would be considerably improved The social scientist who 
would use observation as a method of investigation must approach 
the situation to be observed free from indoctrination, ready to 
describe what is there without bias 

Clarke (2) in The Ait of Straight Thinking points out what we 
shall list as a ninth essential to good observation freedom from 
excitement When the emotions are aroused, all conduct becomes 
impetuous, impulsive The tendency to interpret instead of observe 
becomes reinforced Mere observation with postponement of in- 
terpretation becomes impossible under excitement We demand 
action, and action requires interpretation of the situation The 
moral of this is to refuse to make observations when a crisis 
arises Do not wait until a child commits a misdemeanor before 
starting to diagnose the situation It is hard for a person under 
suspicion to escape being misinterpreted Every misstep, every 
slip, every aberration is seen as additional corroboratory evidence 
Observation should be made systematically when there is no crisis, 
and Its results should be recorded to be used when the crisis 
arises if necessary. 

Selecting and Defining What Is to Be Observed 

Observation may be divided into two types — finding observa- 
tion and directed obseivatioii In the foimer one knows simply 
that he is to observe a given situation, not what he is to look for. 
It may be that he is to try to discover the factors inherent in 
the given situation, as when the writer (10) once conducted a 
senes of observations on the study activities of high school boys 
He admitted before he began that he did not know what study 
was, or how it was constituted, but he could state that his pur- 
pose was to make a survey of the activities constituting study 
by observing boys while they studied To conduct such a finding 
observation the observer must be generally competent in the field 
and must know the possibilities Every finding observation is 
restricted by the limitations of the observer’s background of ex- 
perience. For example, in attempting to discover the factors enter- 
ing into high school study, I might have neglected various clerical 
skills of copying, using the alphabetical index, and the like, if I 
had not been sensitive to these factors from my general experience 
with various sorts of reactions. ■' , ‘i 
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Directed ohiervatmv on the otbei hand is quite definitely limited 
to the schedule used A list of acts which has been pieviously 
prepared provides the basis of the observation Attention is con- 
centrated on the occurrence of the items in the list, while every- 
thing outside of this list is ignored 

The finding observation must precede the directed observation, 
because only on the basis of a finding observation can the inven- 
tory or check list to guide the directed observation be constructed. 
Andrus, (i) in her pieliminary woik in inventorying the habits 
of young childien, had her observers keep diary records which 
were merely running notes of every act of the child under observa- 
tion Entries m these diaiies weie then analyzed into separate 
habit lists, and from these the inventory list was constructed 
Enough diaries were used to ensure that the check-list would not 
be appreciably increased by tlie use of additional diaiy lecoids 
This check list then became a guide to be used in subsequent 
directed observations 

Other more analytical methods of selecting the check list to be 
used in directed observation have been employed Olson, for in- 
stance, derived an inventory of tics and nervous habits in children 
by a tabulation of their mention in the literature on nervous and 
mental diseases This method, of course, depends on the observa- 
tion of other persons who are recognized experts in the field After 
Olson (8) had completed the first part of his investigation, he 
determined the correlation of each of the five groups of nervous 
habits — oral, nasal, hirsutal, oculai, and auial — with the total 
number of nervous habits, and found that the oral habits gave 
the highest correlation, -j- 77 This was taken to indicate that 
oral nervous habits are the most symptomatic group, and so In 
later work he concentrated his attention on this particular group. 
This empirical method of selecting items to be observed has much 
to commend it 

Miss Thomas states that she selects her items for observation 
both on the basis of their social importance (validity) and also 
on the reliability with which they can be observed and the degree 
to which they can be observed with freedom from prejudice The 
latter is mainly a matter of trial, but also depends partly on the 
way in which the Items are defined and limited. For determining 
the importance of the items she has recourse to finding observa- 
tions: ' , . . - ' ; 
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“Not only must we emphasize how we obtain data, but what 
we obtain must of itself be important. It may be questioned as to 
how we know we are getting at what I have called ‘important’ 
aspects of social behavior. We say ‘important’ advisedly — for some 
selection on an a prion basis is necessary in statistical investiga- 
tions We consider con elation coefficients and other instruments 
depending on the theory of probability as poor means of discovery 
without a preceding intimate knowledge of the way in which the 
data behave, but excellent instruments for controlling bias and 
evaluating relationships if we first know our materials.” (ii, 
P- 19) 

Olson, however, made a perfectly justifiable use of correlation 
in selecting important items 

An adequate description of the items to be observed is very 
important, foi the precision with which the items are described 
determines the amount of interpretation which the observer needs 
to make Miss Loomis (ii, ch III), working under Miss Thomas, 
found that there was little difficulty in recording the number of 
physical contacts one child made with another child, but that there 
was difficulty in classifying them into such categories as “hit, 
point, pull, push, caress, explanation, accident, or assistance ” Of 
two observers making simultaneous records on the same situation, 
one recorded, “Conflict between William and Edward,” whereas 
the other observer recorded it “William embraces Edward. . ” 

(ii, p ii) To reduce the subjectivity of interpretation is the 
purpose of detailed descriptions of the items to be observed 

Observation as Measurement 

Once the items for observation have been selected and defined, 
one must decide upon the unit of measurement Newcomb (7) 
in his study of problem boys carried a pack of cards around in his 
pocket and simply jotted down incidents as he noticed them as 
they occurred at one time or another during a summer camp 
period. The unit of measurement here was the occurrence or non- 
occurrence of a given act over a penod of time. Others who have 
more control over the situation have tried counting the number 
of occurrences of an act in an interval of time. But as Olson found 
in connection with his nervous habits, sometimes a child will con- 
tinue to keep his thumb in his mouth continuously while another 
child will keep putting his thumb into his mouth and taking it out. 
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In such a case a count of the “number of times” a child puts his 
thumb in his mouth is of little value 
An advance over these methods of measuring in observation is 
to break the peiiod of observation up into short intervals and 
note the occurrence or non-occunence of the act or habit during 
each interval These intervals may be mere segments of a con- 
tinuous peiiod, or they may be intervals coming on separate days. 
The latter method will give the better sampling. Olson tried ob- 
servation periods of live and ten minutes, one a day for from ten 
to twenty days. His technique of using as the unit of measurement 
the occurrence or non-occurience of an item within a short time 
interval is a distinct contribution to the method of observation as 
a tool for scientific experimentation. 

Miss Parten, working undet Miss Goodenough (3), extended 
Olson’s technique by obtaining forty observations over one-minute 
intervals The empirical results were also almost entirely in agree- 
ment with the theoretical piedictions from twenty obseivations 
by the Spearman-Brown formula, indicating that almost any de- 
siied reliability can be obtained by incieasing the number of ob- 
servational samples. Later ten-second intervals were used, a check 
being made at the end of each interval of the presence of the 
act or behavior under observation Seemingly there is no limit 
to the increase in reliability to be obtained by (a) decreasing the 
size of the observation interval, (b) increasing the number of 
observations, and (c) properly spacing the observations to take 
care of the sampling of varying concomitants of the situation such 
as group activity, time of day, fatigue, persons present, etc 
Exactly how these three variables in the measuring process- 
of observation will be weighted must be decided empirically Sev- 
eral factors are concerned, and some traits may require longer 
time Intervals than others Simple situations, such as movement 
vs passivity, laughing vs not laughing, reading vs not reading, 
need only short time intervals Other more complex traits, such 
as cheating, borrowing, teasing, and the like, need longer time 
intervals. The frequency of occurrence of the behavior is another 
factor that must be considered in deciding upon the time interval 
“Making the assignment” as an item in teacher observation comes 
only once a class period, and very short observation intervals are 
out of the question. The number of observations is also determined 
by the objectivity of the trait to be rated. Traits in which there 
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is a subjective element need more observation in order to achieve 
the desired icliability. Constancy of manifestation of the habit is 
still another factor 

Miss Thomas raises a pertinent issue in this connection If one 
IS interested m studying growth or change or development, the 
observations must not be stiung out over too long a period of 
time. Indeed, the aim m this case should be to get as instantaneous 
a picture or cross-section as possible Some soit of compromise 
must be made between extending the observations for the purpose 
of obtaining reliability and compressing them m order to get a 
more nearly instantaneous picture This issue will be particularly 
important where observation is used to record the result of learn- 
ing experiments conducted over short periods of time. One solu- 
tion is to increase the number of simultaneous observations by 
different observers. 

Miss Thomas points out that there are two questions involved 
in the reliability of observation, one of which is the consistency 
of a given expciimenter’s obser\ation over a period. This is partly 
a matter of the skill of the observer, but is largely determined by 
the sampling and range of items to be observed Observation in 
a narrow situation may yield more leliable results than observa- 
tion when the situation varies. Again, in the variable situation 
greater reliability can be achieved by extending the observation. 

The other type of reliability concerns the agreement of two 
observers. Here the matter of objectivity enters, two observers 
will agree more closely if the behavioi to be observed is objectively 
defined and if interpretation is not called for Skill of the observer, 
sampling of situations, and number of observation periods all enter 
into the reliability of the observations of two judges Of the two 
types of reliability, the agreement of two judges is more important 
than the consistency of a single judge because the former includes 
the latter. 

Olson finds the reliability in observing oral nervous habits in 
children to be .76 for ten ten-minute observations and .87 for 
twenty five-minute observations (240 cases of school children from 
age eight to age twelve). Similar figures for 169 children in depart- 
mental classes in school were .69 (ten ten-minute observations) 
and 82 (twenty five-minute observations) Quite obviously the 
advantage in reliability goes to the larger number of observations 
and the shorter observation interval. - » 
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Olson (8, p. 24) also shows that the constancy of oral habits 
decreases with increasing intervals between observation periods. 


Table i 


Constancy of Oral Nervous Habits over an Interval 
(from Olson) 


Grade 

No of 
subjects 

6A. . . 

... 25 

SB 

31 

SA 

31 

2B-2A 

29 

2B-2A 

32 

1A-2B 

26 

sA-sA 

40 


Interval 
in days 

Method • 

r 

r corrected 
for 

attenuation * 

8 

1 

80 

98 

40 

2 

SO 

S7 

35 

2 

SI 

59 

49 

I 

49 

.60 

158 

3 

26 

32 

ISO 

3 

49 

59 

365 

2 

4.0 

46 


• Method I Ten ten-minute observations vs twenty five-minute observations 
Method 2 Twenty five-muiute observations vs twenty five-minute observa- 
tions 

Method 3 ^urteen ten-minute observations vs twenty fiv'e-minute observa- 
tions 

•* In correcting for attenuation the reliability of the ten ten-minute observa- 
tions was taken as 76, for the fourteen ten-minute observations as 80, and for 
the twenty five-minute observations as 87 

Although certain inconsistencies in the technique tend to make 
the correlations over long periods too low, the evidence shows that 
with accurate observation there is high consistency of habits 
over short periods, but that this drops off over longer peiiods 
Even after a year, however, theie is considerable constancy in the 
number of oral nervous habits that a child retains. 

Olson also gives reliability coefficients showing that some nerv- 
ous habits aie observed less reliably than others, partly due to 
the difficulties in observation 

Miss Thomas reports several sets of reliability coefficients for 
observing different functions In a study by Miss Barker in which 
the time spent by children m different activities was observed, 
the reliabilities ran from 92 to 96 This, however, does not in- 
clude errors of the type in which one observer fails to check an 
activity noted by the other observer, a type which was present 
in a considerable proportion of the cases. When observers are 
trained to time all activities, the reliability rises to 9S-.98 Similar 
reliabilities ( 97 and 98) were found in the observation of two 
observers concerning the amount of space covered by children in 
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their spontaneous activity. Lower reliabilities (.47 to 80) were 
found for “number of social contacts,” but this is due to lack of 
agreement as to how social contact shall be defined The reliability 
for observations of two observers on percentage of time spent in 
social situations by each child was 69 and 86, for the number 
of social situations recorded foi each child, .85 and 86, and for 
the number of children played with by each child, 77 and .88. 

Miss Goodenough reports the following reliability coefficients 
on observations by two independent observers, whose combined 
observations did not occupy, on the average, more than twenty 
minutes for each child, and which were not made simultaneously. 
It is inferred that the time interval taken as the unit of measure- 
ment IS ten seconds. 

Table 2 

Reliability Coefficients for Observation 
(from Goodenough 3 p 234) 


Physical activity 

58 

Leadership 

74 

Gencial activity 

80 

Anger 

. 71 

Laughter 

32 

Dramatic play 

87 

Com crsation 

72 

Self-help 

• 35 

Social participation 

60 

Response to food .. . 

• SO 

Reluctance 


71 



These reliabilities are not high, but show what can be achieved 
with untrained observers and under somewhat unfavorable cir- 
cumstances. 

Conditions of Observation 

The necessity that the observer be m a favorable position to 
observe is so obvious a factor that it needs little discussion Olson, 
for instance, took a position at the front and side of the room 
where he could observe the faces of all the children in the class. 
Much that passes for observation, however, is obtained from only 
fleeting glimpses, with obstructions to the view, and in face of 
distractions of various kinds 

, One critical issue m the technique of observation is the amount 
of control of the situation which should be exercised. On the one 
hand, we may make observation in a strictly laboratory situation 
with various factors such as light, temperature, and material and 
social stimuli rigidly controlled On the other hand, we may wish 
to exert no control whatever over the situation. In the laboratory 
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type of situation the numbei of variables is reduced so that the 
naked relationship between two variables is moie clearly appre- 
hended. On the other hand, there aie those who maintain that the 
most valuable information comes from judging how a child reacts 
in a normal social environment as it happens to develop. The ques- 
tion at issue here goes back to the fundamental nature of social 
behavior. Some maintain that the factors operating are so com- 
plex that it is not possible to take Humpty-Dumpty apart in the 
laboratory and study the pieces, and then properly know how 
these pieces will work when put together again. That the whole is 
not equaled by the sum of its parts is the belief of many students 
of social reactions. 

Certainly greater control could be obtained if it were possible 
to study the independent influence of separate factors, from which 
a certain amount of generalization would be possible If we are 
reduced to describing the behavior of total situations, the number 
of different situations to be investigated becomes discouragingly 
large. But it must be admitted that the problem is not so simple 
as in the physical sciences, where the influence of each factor is 
observable. In the social sciences the factors are more numerous, 
seemingly defying control and measurement, and hence the imme- 
diate advantage of analysis is not visible 

A kindred issue concerns the part the observer should play. 
On the one hand theie are those who would remove the examiner 
entirely from the situation. If he is present, it is agieed he should 
be a passive and unobtrusive visitor, but it would be better still 
if he could be invisible The Yale Psycho-Clinic has devised a 
screen behind which the observer becomes invisible to children 
in the room, and is therefore eliminated as a stimulating factor 
in the situation. The writer became skilful in obscivmg the study 
habits of boys through a technique of concentrating on a certain 
boy until the boy looked at him, whereupon he would turn his 
attention elsewhere. It is possible to disarm suspicion by appar- 
ently being unaware of what is going on. At the other extreme is 
the observer who actually take s an act ive part in th^situation by 
asking questions, making suggestions, tea^ng, and the like. Ob- 
servation may be an important feature of an interview. The part 
that an observer takes in the situation depends on the particular 
problem under investigation. 

Quite apart from its bearing on the matter of reliability, there 
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are issues connected with the length of the observation period. 
Sometimes, as in the study ot sleep or the influence of drugs, the 
observation may be earned over extended periods Again, if it is a 
case of merely studying the frequency of ceitain simple behavior 
items, the period of observation may be very short To observe 
a teacher for fifteen minutes in order to formulate a judgment 
as to that teacher’s efficiency is ridiculous A complex process such 
as teaching cannot possibly be adequately sampled in so short a 
period On the other hand, Olson found 100 minutes to be ade- 
quate for the reliable observation of nervous habits of a class-room 
of childien 

\^’hen one is studying growth or development, the observation 
may be earned on for months or yeais, although of course not 
continuously Certain wnteis have kept diaries and others accurate 
rccoids of their own childien’s development based on observation 
earned on over long periods of time On the other hand, an in- 
stantaneous cross-section picture may be what is desired. 

Whether to concentrate the observation on one child or on 
several childien is another issue Naturally it would be more eco- 
nomical if more than one child could be observed at a time. In 
general one may say that m a finding observation the attention 
should be devoted to one child or to a very small group If, on 
the other hand, the observation is diiccted by a very short sched- 
ule of Items which are easily observable, more than one pupil 
can be observed at a time The writer m his investigation of study 
habits followed one boy around the school for a whole morning. 
His purpose in concentrating on a single boy was to make an 
intensive study of everything that one boy would do relating to 
study procedures. If, on the other hand, certain items of study 
behavior were hsted for observation, it might be possible to cover 
the activities of a whole class-room or study hall at once by the 
use of seating charts and symbols to represent different activities. 

Recording Observation 

Finally the importance of record-keeping in observation must be 
emphasized The development of knowledge and the scientific 
method depends on the accumulation of records. Observation is 
resorted to only because the phenomena to be studied are fleeting 
and transitory and do not leave their own records. We cannot 
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depend on memory, because impressions are nearly as fleeting as 
the phenomena themselves Items of observation not only become 
lost, but are mutilated or altered when the memory alone is 
charged with their preservation. Observations should be recorded 
on the spot, since impressions lose their vividness even after a 
few hours. 

In finding observation the records must be in the nature of 
running commentaries or diaries. While investigating study habits 
the writer kept at hand a small note-book in which eveiy act of 
the boy under observation was stealthily entered In observing 
small children the recoid need not be kept out of sight, as its 
meaning is not understood and consequently it does not become 
a disturbing factor in the situation These diary records should 
be revised for permanent filing because in their oiiginal state they 
are apt to be fragmentary, sketchy, and full of ciyptic symbols. 

In directed observation a check list is used, and the technique 
of recording is correspondingly radically altered In the fiist place, 
the observer must be thoroughly familiar with the inventory or 
check list There is usually no time to run over the list in search 
of a particular item. Its place in the list must be known ahead of 
time. Usually it is possible to prepare special sheets so that the 
occurrence of an item can be checked A favorite device available 
in class-room observation is the seating plan, in which each child’s 
name is inserted Armed with such a seating plan the observer 
can record through a set of symbols the events of the class-room 
period, the identity of those asking questions, the identity of those 
answering questions, and the nature of the responses. Miss Barker 
(ii, ch II), in studying the movements of nursery school chil- 
dren, made use of a floor plan of the space on the roof of the 
building where the children engaged in free activity By the use 
of lines to denote a child’s movement and position, and symbols 
to denote the nature of the activity, certain aspects of behavior 
were recorded so as to be susceptible to accurate measurement. 
The use of these diagrammatic records of observation, be it noted, 
requires a certain amount of skill and practice. 
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Chapter III 
RATING METHODS 


Purposes of Rating 

R ating methods, which are among the recognized means 
of measuring conduct, are used both for purposes of in- 
vestigation and in practical ways to evaluate personnel 
in industry and schools. Introduced at first timidly and with 
reservation, rating methods are now employed with growing con- 
fidence in large industrial, mercantile, and financial establishments 
Reports have been made of the results of rating methods to assist 
in recommending promotions, dctei mining wage increases, and de- 
ciding tiansfers to positions of greater authority In education, 
ratings are used widely in school systems to evaluate personnel. 
Teachers’ agencies quite generally lesort to them in an endeavor 
to obtain infoimation conccining then clients Rating methods 
help a large organization keep the same contact with its person- 
nel that may be obtained through peisonal observation in a 
small group. Systematic rating has been credited with the fol- 
lowing advantages. 


Advantages of Rating 

Rating is an aid in administration. By means of a permanent 
and objective record, an administration is enabled to evaluate the 
worth of Its personnel A school superintendent or business man- 
ager, through the use of ratings, may be enabled to make wise 
decisions regarding employment, placement, transfers, promotions, 
and dismissals. Although ratings are imperfect, as will be subse- 
quently shown, nevertheless by the use of various devices an 
administrator is able to make less subjective decisions than cus- 
tomarily result when the only evidence is that of casual observa- 
tion, gossip, or the influence of personal favorites. Rating methods 
are not merely another device of the efficiency expert. They are 

41 
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a means of getting important information that often can be 
gained in no othci nay 

Ratings stimulate the person being rated. Ratings stimulate the 
person being rated It is often lecommendcd that persons be shown 
the results of their rating If this is done in a dignified, impersonal 
way, ahd if the person rated is made to feel that the ratings have 
been honestly given, and paiticulaily that they represent the 
pooled judgment of several of his supeiiors, the result should be 
salutary. An employee is thereby made aware of his deficiencies 
and shortcomings and has every incentive to improve himself in 
those particulars wherein he falls short Furthermore, if the em- 
ployee has previousl)' rated himself on the same scale and then 
compares his ovin estimate of himself with the estimate made by 
otheis, he can often be made to see cleaily in what inspects he has 
misjudged himself in comparison with the impression he makes 
on others. A sympathetic supervisor can often make ratings an 
effective tool in his delicate task of impioving the efficiency and 
morale of personnel. 

It should be added, however, that in certain successful person- 
nel work in industiial and meicantile establishments rating meth- 
ods have been found to be relatively mflcvible and rigid in com- 
parison with interview methods, especially when the latter aie 
conducted from the psj'chiatiic point of \icw ''' 

Ratings react in a favorable way on the person doing the rat- 
ing. Ratings react in a favorable way on the person doing the 
rating In drawing up the rating schedule he is made to think out 
closely just uhat qualities arc dcsiiable in his employees He 
becomes more observant both of the desirable qualities and of 
the differences among his staff m respect to these qualities He is 
made more sensitive to undesirable points in a man, but he is 
also quicker to recognize and rewaid piaiseworthy traits Be- 
cause his attention is directed to the characteristics of his workers 
he becomes more sensitive in appreciating them, and the resulting 
in increased interest has its rebound in a better morale among 
the force. 

Ratings, if periodically given, help keep alive the personnel 
spirit. They make a staff feel that they are being held responsible 
for good service There is nothing quite so destiuctive as neglect. 
An unoccupied house apparently detciiorates much faster than a 
•See Anderson, V, V., Psychtatry tn Industry (Harper St Brothers, 1929). 



Rating Methods 43 

house wherein people live Failure to pay attention to personnel 
IS sure to lead to caielessiicss, slovenliness, and loss of efficiency. 
In a large organization vheie the peisonal touch of the manage- 
ment can only extend a little way, ratings repeated peiiodically 
and systematically help to keep the personnel spirit alive This 
IS the more true if the force is given e\ idence that the ratings are 
not merely filed away, but are studied and used as a basis for 
decisions concerning the individuals rated. 

Ratings help make judgment anal5rtical. Without rating meth- 
ods one IS inclined to gne an opinion concerning a person which 
is no more than a general impression A rating schedule usually 
asks for opinions on scieral different traits or qualities If thought- 
fully selected, these separate qualities may be made to represent 
important sides of a man's personality In using such a scale the 
rater is forced to consider one quality or trait at a time to the 
exclusion of others — he is forced to be analytical. At the end of 
the rating he has not expressed a single blanket opinion concern- 
ing a man, but has analyzed his opinion into many phases. No 
man is wholly or evenly good or bad in all respects, and the 
carefully planned rating schedule yields a judgment that indicates 
points of relative strength or weakness A rating scale makes it 
possible to give analytical judgment 

Ratmgs systematically given tend to make judgment repre- 
sentative. If an opinion 01 a judgment of a man is given in an 
emergency or when the man is under special consideration, the 
rating is especially likely to be colored by the rater’s likes and 
dislikes and his own intciest m the situation. If an opinion is 
given after a man has made himself prominent by some word 
or act, the rating is suie to be waipcd It will be a near-sighted 
estimate Ratings should be made on no special occasion and 
under no unusual pressuie, but regularly and systematically as 
a routine measure. 

Finally, ratings are a recognized method of gettmg data for 
research purposes. In psychological research rating has already 
had wide use and has performed valuable service Cleeton and 
Knight (21), for instance, resorted to ratings to demonstrate that 
character could not be read by external signs In the field of 
social psychology and in the study of the personality there is 
often no other method of measurement available than rating. 
Ratings are used to validate more objectiye testing method^. The 
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rating rlata that accumulate in offices, industrial plants, and school 
systems may yield valuable information in research as to the 
importance of certain personal qualities m work of various kinds 
Any attempt to develop objective testing methods for the measure- 
ment of personnel has to depend upon ratings for its validation 

Administration of Ratings 

Ratings should be given periodically and systematically. Most 
rating systems to-day are carefully planned and provide that 
records be entered on a piepaied chart \'ciy frequently a graphic 
chart (described later) is used In introducing ratings on a large 
scale, plans should be laid for a uniform and standardized system. 
Simply to ask supervisors and managers for an offhand expres- 
sion of opinion IS ceitain to result in chaos, and the information 
they will turn in will have all of the defects of the most sub- 
j'ective evidence In order to make ratings as accurate, objective, 
and free from bias as possible, the conditions under which they 
are made should be ihoioughly standaidizcd Ratings should be 
made on caids of unifoim si/e, for convenient sorting and filing. 
Unifoimly punted laling blanks mil make possible a comparison 
between diffeicnt latings and so enable the statistician to compile, 
summarize, and interpret the lating results of various groups at 
different times 

Uniform and standardized blanks should be used in rating. 
If uniform and standaidized blanks are prepared, and if some 
graphic scheme of marking is adopted, the rating is easy These 
are the important factois to be considered Every rating repre- 
sents an act of judgment, weighing, and comparison Usually rat- 
ing requires the making of fine distinctions on slender evidence, 
hence any device that can reduce the difficulty of the task is of 
value Exccutu es find the modern standardized graphic rating 
blanks easy to use. Raters, freed from thinking up things to say 
and searching for descriptive adjectives, can concentrate directly 
on the rating. 

The best results aic obtained if ratings are made periodically, 
either annually, semi-annually, or quarterly They cannot be made 
too often, at least so it seems if tve overlook the piactical objec- 
tions that tune is rcquiicd to make the judgments and that most 
people feel a natural reluctance to making judgments of this kind. 
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Of course in administering the system a definite date should be 
set on which the ratings aie due 

Ratings should be made to yield quantitative scores. Although 
the use of descriptive adjectives such as excellent, good, average, 
fair, or poor smacks of subjective evaluation, it is probably true 
that a rating in terms of a number scale is no more objective 
than one in terms of descriptive adjectives Indeed, the graphic 
scale does away with the use of numbers and permits the rater 
to think entirely in terms of descriptive adjectives, a far more 
familiar form of thought But whether the rating is given in nu- 
merical terms or whether dcsciiptive adjectives are translated 
into numbers, numbers are necessary if the ratings are to be given 
statistical treatment Although for the more immediate uses of 
rating in personnel woik statistical treatment is not necessary, 
whenever ratings are gathered with a possibility that they may 
be used in a research way they should be quantitative A numeri- 
cal rating scale makes it possible to check up on the accuracy 
of the raters. 

Ratings, for use either in personnel work or in research, should 
be in the form of an obj'ective record. All scientific woik demands 
data that may be studied and worked over In the natural sciences 
one may study actual matciials or organisms The chemist works 
with elements and compounds, and the biologist studies tissues 
under the microscope And the psychologist, whose material Is the 
transitory reaction, seeks for data in no less objective foim He is 
fortunate when the reaction leaves behind it a permanent record, 
as when a subject takes a paper and pencil test At other times 
he IS forced to make special arrangements to obtain a permanent 
record of the reaction made A rating is a permanent, objective 
record of one’s observation of conduct The value of a rating is 
diminished only by the subjectivity of the obseivation which it 
records In its use in peisonncl administration, an objective record 
of the rating has othci values, such as the possibility of compari- 
son with other records of the same individual obtained at other 
times and under other conditions; or by getting cumulative rat- 
ings, successive ratings may be compared with each other, growth 
may be noted, and progress may be determined. If ratings are 
worth making at all, they are worth keeping. 

Besides the original rating sheets, which should always be filed 
away and kept, there may be several types of transcripts. In a 



46 Diagnosing Personality and Conduct 

large organization a checking sheet should contain data as to the 
tune when ratings aie turned in, objective measuies of the ac- 
curacy of the rating and notes as to the care and value of the 
ratings of each of the raters Another sheet may contain a sum- 
mary of the ratings of each employee so ai ranged as to show 
the compaiison of ratings on any individual and the growth or 
change o\er a period of time Ratings also should be entered on 
each employee’s personnel lecord card, and here again cumulative 
records may indicate pi ogress and enhanced value of the man 
to the company 

Observation and note-taking preliminary to rating. The best 
results m rating come fiom taking pains One way to get poor rat- 
ings is to ask a peison to sit down and give in an offhand, oppor- 
tunistic way his opinion Ratings, in order to be of value, must 
be made with more than ordinaiy caie In the fiist place the rater 
must be brought wholly into sympathy with his task His attitude 
must not be that of passive acquiescence in the task of giving 
latmgs, rather, he must take an active interest in assuring himself 
that his judgment is woilh seeking, is merited. 

Good rating demands a prcliminar>’ period of observation Dur- 
ing this obscivation the rater should be instructed to watch for 
certain definite things. Usually in latmg persons the observa- 
tion should be directed to a man’s acts and his conduct Even if 
the rating is to be on a man’s qualities 01 tiaits, those qualities 
or traits should be analyzed into then behavior components, and 
these acts of conduct should be obsei\ed to aid in forming a judg- 
ment Webb (i2o) recommends that the observation period 
should be extended not meiely over houis or days, but over weeks 
and months An hour’s observation is certain to result in a nar- 
row sampling of a man’s conduct, and such ratings as are based on 
limited observations are sure to be waipcd Webb also notes that 
the observations of diffeient raters must be independent Raters 
should not compare notes before giving their ratings, but wait 
until after their judgments are recorded If the observations are 
not Independent, the separate judgments will not count as single 
judgments, for one judgment will influence another. Notes, which 
are descriptions of actual conduct, should be taken during the 
period of observation, and as far as possible the rating should be 
based on the objective records which the notes contain Good 
notes contain a minimum of generalization, inference, or opinion. 
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Before giving the rating the rater should dispassionately review 
the evidence contained in his notes based on obseivation He 
should eliminate as fai as possible pcisonal bias, his likes and dis- 
likes, common gossip, and leputation The ideal of lating is a pure 
act of judgment based on whatever evidence he is able to gather. 
Thus by making long, careful obseivation, lecording these obser- 
vations when made, and basing judgment on what is observed, 
ratings are brought to their highest reliability. Quite often one is 
asked to rate on qualities oi traits which one has never even had 
the opportunity to obscive Again one may be asked to rate on 
qualities or traits to which he has never given his attention Only 
too fiequently we aie asked for facts for which the evidence has 
never been collected 

The training of raters. This is an obvious sequel to the need 
for observation m older to obtain good ratings It is com- 
monly yet erroneously assumed that any one can give ratings. 
The best results come after the raters have geen given in- 
structions in the art of lating and, as pieliminary to this, in 
the art of observing and note-taking They also need instruc- 
tion on how to use the scale, how the scale is maikcd, how to 
give ratings that yield an unskewed distribution, and how to 
avoid the pitfall of allowing the geneial impression of a per- 
son to influence the rating on a specific trait Raters need 
practice in rating and criticism of their ratings by a competent 
person. 

Kingsbury (6i) reports the results of experience in the train- 
ing of raters in a large bank He pioposes the following procedure. 
First, there should be instruction in the purposes of rating and 
in the use of the particular rating scale adopted Second, the rater 
should make a set of ratings Thud, these ratings should be 
analyzed by the psychologist Fourth, the psychologist and rater 
should confer. Fifth, there should be reinstiuction in which the 
particular faults and weaknesses of the rating are pointed out. 
Sixth, there should be rerating Kingsbury states that this pro- 
cedure was repeated three times before the raters were given a 
certificate of competence. Only after similar preparation can satis- 
factory ratings be obtained. 

This particular institution prepared a rater’s manual which was 
used m giving the instruction in rating. This manual contained the 
following_topics. 
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1. Statement of the purposes of rating. 

2. Exposition of the lesponsibility of raters. 

3. Detailed description of rating scales 

4. Description of the piocedure to be followed in making ratings. 

5. Enumeration of common errors to be guarded against in 
rating. 

6. Explanation of some of the methods used in analyzing ratings 
which it IS suggested the manager may use m checking his 
own ratings. 

The two main checks that can be readily and objectively ob- 
tained on a rating are average error and direction of error. 


Table 3 

Computation for Determining Average Error and Direction of 
Error in Rating 


Individuals 

Average 

Hating 

Differences 

Positive 

Negative 

rated 

rating 

being 

smthout 

differences 

differences 



studied 

signs 



A 

6S 

6 0 

OS 


—OS 

B 

2 6 

40 

14 

14 


C 

79 

70 

09 


—09 

D 

3 * 

30 

02 


— 02 

E 

08 

30 

22 

22 


F 

45 

50 

OS 

OS 


G 

67 

40 

27 


—27 

H 

84 

90 

06 

06 


I 

80 

8 0 

00 



J 

2 2 

30 

08 

08 





98 

SS 

—43 




9 8 — 10 = 98 




Average error in rating .. 098 

Rating too high by an average amount of . o iz 

The average rating is the average of the several supervisor’s ratings on the 
individuals being rated 


The steps in the procedure are as follows: 

1. Place the ratings of the supervisor who is being studied 
opposite the “true ratings,” i.e., the ave.age rating of all the 
supervisors as in the above example. ' 

2. Take the difference without regard to sign and place in the 
fourth column. 

3 Find the sum of this column. 

4. Divide this sum by the number of individuals rated. This 
is the average error in rating. 
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5. If the rating is higher than the true (average) rating, place 
the difference in the column headed Positive differences Place a 
plus sign before each of these differences If the rating Is less 
than the true (average) rating, place the difference in the column 
headed Negative differences. Place a minus sign before each of 
these differences. 

6. Add each of these columns: Call the sum of the Positive 
differences column plus, and the sum of the Negative differences 
column minus. 

7. Subtract the sum of these two columns. Give the difference 
the sign of the larger number. 

8. Divide by the number of ratings made If the rating is the 
same as the true rating, the difference is zero and will not belong 
in either the positive or the negative column. One divides by the 
total number of ratings in any case If the result is plus, the 
rating has been too high on the average, if the result is minus, 
the rating has been too low. 

Schools are cautiously trying out ratings for the purpose of get- 
ting a supplementary mark in the shape of a measure of citizen- 
ship or other character traits. Scattered bits of evidence would 
indicate that there are immense possibilities of value in this 
rating, although — except in one or two instances — this rating has 
hitherto never been undertaken seriously. Where latings have 
been discredited, the reason is partly that they have never been 
given a thorough trj^-out. It has been taken for granted that any 
one can make a character rating Yet rating the quality of an 
English composition, 01 of a specimen of handwriting, is a skilled 
art which is mastered only after considerable practice Schools 
should not undertake more extensive ratings until they have 
pondered over the nature of the problem they are up against. 
Standardized blanks or record cards should be drawn up and 
printed on which to record teacher ratings of pupils, these ratings 
to be called for periodically. Teachers must be encouraged to 
make systematic observation and must be requested to make 
notes of the results of their observation Ratings which have been 
made should be studied statistically to determine average error 
and direction of error of the ratings Then teachers should be 
called into conference and given definite instructions as to how 
their ratings may be improved An upper limit of error may 
be established and this diagnosis and instruction of the teachers 
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as raters should continue until all attain a proficiency within the 
limit of error allowed If ratings are worth while at all, they are 
worth doing well. 

Traits as a Scale of Measurement 

, Because we usually judge a person in qualitative terms, it comes 
as a distinct surprise to some of us to learn that it is possible to 
rate another person quantitatively In the old days (which have 
not altogether gone'), the value of a man’s services was reported 
by the use of descriptive adjectives Some of the qualities ascribed 
to Colonel Lindbergh in promotion leports in the army as cited 
by the President were “purposeful,” “serious,” “deliberate,” 
“stable,” “efficient,” “industiious ” But even so remarkable a 
character as Lindbergh probably does not always show delibera- 
tion or industry to the same degree at all times or in all situa- 
tions. Descriptive language rests almost wholly on an all-or-none 
basis, while conduct itself is not dichotomous (serious-frivolous, 
good-bad, etc ) Characterization is lough and crude when it is 
accomplished by means of descriptive adjectives. 

In assigning ratings one comes to think of a trait as possessing 
degrees It is as though a scale weie constructed different points 
along which would represent diffeient degrees of the trait. Even 
though theie is no objective means for defining points on the 
scale, and descriptive adjectives must still be resorted to, the gam 
comes in recognizing that the trait may be present or absent in 
varying degrees Suppose that the rater is giving his judgment 
as to the honesty of an individual He does not pronounce him 
merely honest or dishonest, but while thinking of a scale running 
from the utmost dishonesty through varying degrees of less than 
perfect rectitude to peifect uprightness, he tries to place the indi- 
vidual somewhere along that scale, according to his judgment of 
the facts. In making such a judgment a rater tries to think of the 
average status of the person being rated in the trait under con- 
sideration, having in mind no special instaiice when the individual 
exhibited the trait in question. 

It will be found that most personnel ratings are on traits rather 
than conduct habits, probably because traits are more compre- 
hensive and general We feel in rating traits that we are rating 
more fundamental and important characteristics of a person than 
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would be the case m rating merely on behavior or conduct How- 
ever, the reality of what is designated by the name of a trait has 
often been questioned Is an individual alwa3's e.xacth* honest in 
entering his golf score or in stating his son's age m buxnng a rail- 
road ticket" If one stops to think just how the rating on some 
trait IS made, only vague lecollections are biought to mind Wells 
(121, p 15) tells us that judgments foi which we can give no 
reason are often as good as those for which we can give a rea- 
son Landis (72) has asked laters on what the\" based their 
judgments, with the inteiesting results that half of the reasons 
given for judgments weic vague and non-specific, evidentlj" made 
on vague, fleeting impressions — ^piobablj' the residue of forgotten 
experiences. Landis found that when foiced to justify a rating, 
raters give the most bizaire leasons He also found that specific 
reasons were most often given when the rating implied some dis- 
tinctlj' undesirable or unsocial tiait In other words the typical 
rating is not the result of a compilation of a senes of clear, definite 
observations, but is a general impicssion based on a variety of 
half-forgotten contacts and expeiiences Scanty evidence at best 
is all that one can recall when asked concerning a person’s 
honesty, perhaps it is an instance of laxity on the part of a 
storekeeper in returning some overcharge that is remembered, 
or perhaps the misiepresentation of an event in a friend’s 
account of it. In short, unless certain events stand out clearly 
in memory, one tends to class all people as about even in rec- 
titude and trustworthiness To give an adequate rating one 
should have noticed the individual carefully in many situations 
where the trait might be exhibited. For instance, if it is a ques- 
tion of honesty, one might observe and give answers to such 
questions as: 

1. Does he return borrowed money promptly? 

2. Does he return money when he receives too much in change? 

3. In traveling does he pay his fare without any attempt to 
dodge ^ 

4. Does he return found articles? 

5 . Does he keep a promise? 

6. Does he obey rules? 

7. Does he play games honestly? 

8. Does he refrain from cheating in an examination? 

9. Does he recite without attempting to bluff? 

10. Does he do his work independently ? 
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In shoit, if what is called conduct rating, as opposed to trait 
rating, is attempted, instead of judging tiaiLs one may rate the 
expected i espouse in a particular situation. The answers to the 
above questions might be simply yes oi 110. But even a single 
and limited act of conduct may icceivc a more detailed rating, 
perhaps in tcims of the different piobabilities of such conduct, 
using adjectives such as never, seldom, sometimes, often, always. 
There is always a probability, langing fiom zero to one, that a 
certain act will take place Sometimes the act will take place only 
once out of 100 possible times, sometimes ten out of 100, some- 
times ninety out of 100; and these chances can be estimated in 
rating. 

Tlieie is undoubtedly a relationship between the rating of traits 
and the rating of conduct The clement in behavior is a reaction 
in a certain situation, and our primary judgments are always 
based on our observations of actual conduct. Just how far one’s 
judgment of a tiait is based on the observation and judgment 
of conduct It IS impossible to state Fuither, it remains for experi- 
mentation to prove that the average of the judgments on the above 
ten questions on honesty will coi relate perfectly with the rating 
of the individual on honesty in general. Although close, they are 
probably never identical Of the two, we should prefer as a meas- 
ure of honesty the summation of answcis to the ten questions, 
rather than a rating on honesty itself For we do know that the 
answers to the ten questions represent a fair sampling of the 
field of honesty, whereas a blanket latmg may rest on evidence 
derived from two or three scattered and partial observations. 

General Methods of Making Ratings 

Rating methods are almost as old as any experimental methods 
in psychology The beginnings of rating methods can be traced 
back to the work of Fechner and Mantegazza in psychophysics. 
The first rating scale, in its modern sense, was that published by 
Gallon in 1883 in a section on “Mental Imagery” in his Inquiry 
into Human Faculty and Its Development* (pp. 64, 65). This 
scale even to-day may be considered a model of its kind. 

• E P. Dutton h Co , Inc. 
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Scale oj Mental Imagery — Imagery of a Breakfast Table 

“Highest — Brilliant, distinct, never blotchy. 

“First Suboctile — ^I’he image once seen is perfectly clear and 
bright 

“First Octile — I can see my breakfast table or any equally fa- 
miliar thing with my mind’s eye quite as well in all particulars 
as I can if the reality is before me. 

“First Quartile — Fairly clear, illumination of actual scene is fairly 
represented Well defined Parts do not obtrude themselves, but 
attention has to be directed to different points in succession to 
call up the whole 

“Middlemost — Faiily clear. Brightness probably at least from one 
half to two thirds of the oiiginal Definition varies much, one 
or two objects being much moie distinct than the others, but the 
latter come out clcaily if attention be paid to them 

“Last Quartile. — Dim, not compaiable to the actual scene I have 
to think separately of the several things on the table to bring 
them clearly before the mind’s eye, and when I think of some 
things the othets fade away in confusion. 

"Last Octile — Dim and not compaiable in brightness to the real 
scene Badly defined, with blotches of light; very incomplete; 
very little of one object is seen at one time. 

“Last Suboctile — I am very raiely able to recall any obj’ect what- 
ever with any sort of distinctness Very occasionally an object 
or image will recall itself, but even then it is moic like a gen- 
eralized Image than an individual one. I seem to be almost 
destitute of visualizing powers as under control. 

“Lowest — My powers are zeio To my consciousness there is al- 
most no association of memoiy with objective visual impres- 
sions. I recollect the table but do not see it ” 

Another famous scale is that set up by Pearson (85) for judg- 
ing intelligence. 


Pearson’s Scale of Ability 

A Mentally defective Capable of holding in the mind only the 
simplest facts, and incapable of perceiving or reasoning about 
relationships between facts. 

B. Slow dull. Capable of perceiving relationship between facts in 
some few fields with long and continuous effort, but not gen- 
erally nor without much assistance. 

C. Slow. Very slow in thought generally, but with time under- 
standing IS reached. 

D. Slow intelligent. Slow generally, although possibly more rapid 
in certain fields, quite sure of knowledge when once acquired. 
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E. Favly intelligent Ready to gtasp, and capable of peiceiving 
facts in most fields, capable of understanding without much 
effort 

F. Distinctly capable A mind quick in peiception and in reason- 
ing rightly about the peiceived 

G Very able. Quite exceptionally able intellectually, as evidenced 
either by the pei son’s career or by consensus of opinion of 
acquaintances, or by school record m case of children. 

In using a scale of this kind one uses a simple ruled sheet on 
which are listed the names of those for whom ratings are desired 
If the different levels of the scale are numbered the appropriate 
numbers may be enteied against each name to indicate the rater’s 
judgment. 

RiTINCS on InTELLIOENCE 

Smith . 3 

Jones . • . 3 

Brown . 4. 

Thompson . . . I 

Black . . ... .6 

Other devices for recording such ratings will suggest them- 
selves to the ingenious psychologist. It is always possible to let 
the process of rating be that of checking or underlining or cross- 
ing out instead of writing in numbers, as shown in the illustration 


Ratings on Intelligence 


Smith 
Jones . . 
Brown 
Thompson 
Black 


I * © 4 S 6 7 

...I a © 4 s 6 7 

...I z 3 (4) 5 6 7 

••© *34567 

• I * 3 4 5 ® 7 


As against these methods of rating, probably the simplest meth- 
ods which could be devised, it has been suggested that in rating 
men on any given tiait much longer descriptions of different posi- 
tions on the scale might be advantageously used. For example, 
each scale point could be illustiated by desciibing in more or less 
detail and in a vivid way actual individuals. The suggestion is 
prompted by the hypothesis that by thus more clearly defining 
each point on the scale the ratings could be given more objectively 
and hence more reliably. 

Another method of giving ratings is called the method of paired 
compansons In this method the rater compares each individual 
being rated with every other individual being rated with respect 
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to the trait under consideration The comparison is simply in 
terms of bettei 01 vioise A good scheme for using this method 
IS to draw up a luled table upon \\hich each comparison is indi- 
cated by placing a cioss in the appropriate square to designate 
superiority First compare individual A with the other individuals, 
placing a cross in the A row under B if A is judged superior to B, 
etc. The marked chait indicates how this scheme is carried out 



The table should be made consistent by seeing that there are 
no conflicting judgments If m the upper light section A is judged 
supeiior to E, then E must not be judged superior to A in the 
lower left section Finally the numbei of crosses in each row may 
be summed acioss for a composite valuation 

Another method in common use is called the ordei-of-ment 
method, or more simply ranking In tins method the persons being 
rated aie assigned seiial numbers fiom i up The most superior 
individual is given l, the next individual is given 2, etc If two 
individuals aie judged to be tied foi a given rank, they aie both 
given a rank which sliikes an average of their rank and the next 
rank following For instance, if two individuals tie for first place, 
both are assigned the rank of i 5, and the next individual is 
ranked 3 If thiee individuals tie foi fiist place, all thiee aie given 
the rank of 2, and the next individual is given the rank of 4 
Ranking is expedited by writing the names to be ranked on cards, 
which may be rearranged in the order desired. 

Still another method of rating is the score card. Whatever is be- 
ing rated, whether it be the quality of school buildings, or the 
efficiency of teachers, or the merit of textbooks, is analyzed into 
constituent parts to each of which a maximum score is assigned. 
These allotments of maximum score are usually determined by 
asking various competent persons to analyze whatever is being 
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rated and to assign lelative weights according to their judgment 
These weights aie then aveiaged Usually the scoie caiJ is so 
constructed that the possible total of maximum ci edits is 100 
or 1,000 In using the scoie card the latcr who is judging a specific 
textbook 01 school building 01 teacher assigns a value to each 
item which is some pioportional part of the maximum credit al- 
lowed that Item in the scoie card These credits are then totaled 
for the final score. The difficulty in this method, which is also a 
limitation upon the adequacy of the method, is in assigning the 
credits to the separate items. 

The score caid given here for illustration aims to measure the 
meiit of church school textbooks. It was constructed by C. C 
Pctcis and is to be found in Volume II of the Indiana Survey of 
Religions Education (pp 109-114), Institute of Social and Reli- 
gious Research, W. S. Athearn, editoi. 


Score C^rd for Measuring the Merit of 
Church Schooi. Te.xtbooks 




POINTS 

POINTS 


Mam Headings 

Sub-Headings 

I 

Mechanical Features 

1 Type . ... 

ns 

26 


2 Attractiveness of page 


20 


3 Pictorial illiistialions 


28 


4 Organization of page 


21 


S Make-up of book or pamphlet 


20 

ir 

Style 

I General literary merit 

too 

4S 


2 Appropriateness of style to age of pupils 


55 

III 

Pedagogical Organization of Lessons 

I Organization of the lesson about an aim 

250 

S6 


2 Type of organization of the lesson 


41 


3 Provision for controlling study 

4 Provision of means to insure function- 


SO 


ing of the instruction 


65 


S Provision for the enrichment of ezperi- 




ence m ways not directly related to 




the lesson aim but not antagonistic to 




it 


38 

rV Teaching-Helps in the Individual Lesson. 

140 



1 A separate manual for teachers 

2 Valuable supplementary material for 


32 


teachers 


31 


3 Useful teaching suggestions 


38 


4 Valuable teaching aids 


39 
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POINTS POINTS 

Main Headings Snb-IIeadings 
V Teaching-Helps Involved in the Organiza- 
tion of the Book as a \\ hole 125 

I Valuable teaching suggestions additional 
to those that constitute an integral 
part of each lesson (as in introductory 
chapter or scattered m short notices 


through the book) . . 34 

2 Supplementary teaching material ... 38 

3 Provisions for giving the teacher per- 

spective of the course . . , *9 

4 Provision for review lessons .... 24 

VI Content 270 

1 Fitness of the material to appeal strongly 

to pupils of the age for which the 

lesson IS intended . 95 

2 Fitness of the material to meet the 

needs of the pupils as defined by child 
psychology and by sociology (age- 

levels considered) . . ' lio 

3 Fitness to meet the specific objectives 

of the particular church (or other 
group) for which tlie material has 
been prepared 65 


Man-to-Man Rating 

One of the significant developments of the rating scale technique 
during the World War was the man-to-man scale. The Army Rat- 
ing Scale, reproduced below, made use of this principle. It has 
often been urged that in rating one should not compare a man 
with abstract qualities defined on papei, but should make a direct 
comparison of one man with another Rating should not consist in 
the assignment of a man to one out of a set of compartments 
labeled excellent, good, average, pool, or bad, but should be a 
direct comparison of one peison with another The question asked 
should be, “Is Jones a better man than Smith in the trait being 
considered. 

The man-to-man rating principle was developed by Dr. Walter 
Dill Scott and his associates in the Bureau of Salesmanship Re- 
search of the Carnegie Institute of Technology. The Army Rating 
^ale was first framed by Scott in May, 1917. The Army Rating 
Scale which is reproduced below contains the five headings Phys- 
ical Qualities, Intelligence, Leadership, Personal Qualities, and 
General Value to the Service. For each quality an officer was to 
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select five men of Ins acquaintance, one possessing the quality 
in Its greatest possible degiec, one midway between this maximum 
and the average, one aveiage m the quality, one midway below 
the average and the lowest degiec and one possessing a minimum 
of the quality. These ‘‘scale men” constituted the scale by means 
of which all other men were laled by a direct comparison. 


How to Make the Scale * {q 2, pp. 20^-206) 

“1. Write on small slips of paper the names of from 12 to 25 
officers of youi own lank and not above the aveiage age of that 
rank They should be men with whom you have scived or with 
w'hom you are well acquainted Include officers whose qualifica- 
tions are extremely pool as w^ell as those who are highly efficient 
If these names do not include all the giades for each of the five 
qualifications, others may be added 

“2. Look over your names from the viewpoint of Physical 
Qualities only Disregaid eveiy other characteristic of each officer 
except the way in which he impresses his men by his physique, 
neatness, voice, energy, and endurance Arrange the names on the 
slips of paper in order from highest to lowest on the basis of the 
physical qualifies of the men Select that officer who surpasses all 
the otheis in this qualification and enter his name on the line 
marked Highest under Physical Qualities Then select the one who 
most conspicuously lacks the qualities and enter his name on the 
line marked Low'est Select the officei who seems about half way 
between the twm previously selected and W'ho represents about the 
general average in physical qualities, entei his name on the line 
marked Middle Select the officer who is half w^ay between Middle 
and Highest, enter his name on the line marked High Select the 
one who ranks half way between Middle and Lowest, enter his 
name on the line maiked Low 

“3 In the same manner make out scales for each of the other 
four qualifications (Intelligence, Leadership, Personal Qualities, 
and General Value to the Service). 

“4. Each officer w^hose name appears on the Scale should be 
one who exhibits clearly and distinctly the qualification and the 
degree of the qualification for which he has been chosen. 

“5. The names for Highest and Lowest on each section of the 
Scale must represent extreme cases. The name for the Middle 
should be that of an average officer, half way between extremes 
High and Low should be half way between the Middle and the 
extremes. An even gradation of merit is important. 

* Scott, VV D, and Clothier, R C, Perstmnel Management (A W Shaw 
Company, 1923), pp 203-206 By permission of the present publishers, McGraw- 
Hill Book Company. ' 
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“6 In making or using any section of the Scale, consider only 
the qualification it covers, totally disregarding all the others 
“7 In rating subordinates of more than one grade, the best 
practice is to make separate scales for each grade, using always 
the names of officeis one grade higher than that of the sub- 
ordinate to be rated However, in exceptional cases good results 
have been secured where a scale constiucted of captains is used 
for rating both lieutenants and captains, and a Scale constructed of 
colonels is used for rating all ranks of field officers. 

How to Use the Scale 

“8. Rate your subordinate for Physical Qualities first. Con- 
sider how he impresses his men by his physique, bearing, neat- 
ness, voice, energy, and endurance Compare him with each of the 
five officers in Section I of your Rating Scale, and give him the 
number of points following the name of the officer he most nearly 
equals If he falls between two officers in the Scale, give him a 
number accordingly (eg, if between Low and Middle, give him, 

7, 7y2, or 8) 

“9 Rate the subordinate in a coi responding manner for each 
of the other four essential qualifications Undei III (Leadership) 
and V (General Value to the Service), consider which officer he 
will most nearly equal after equivalent experience 

“10. In rating, make a man-to-man comparison of the sub- 
ordinate with the officers whose names appear on your Scale — 
never in terms of numbers dnectly Disregard the numerical 
equivalent until you have made these concrete comparisons 

“ii. When rating sevcial subordinates, rate all of them on 
each qualification before adding the total for any one. 

“12 This IS not a percentage system and you should not allow 
yourself to fix in mind any paiticular number of points you 
think the subordinate ought to get 

“13. The total rating for a subordinate is the sum of the ratings 
you give him in the five separate qualities If directions are fol- 
lowed carefully, the average of any considerable group of officers 
rated is about 60 points In othei words, 60 points for a lieutenant 
means that a captain has compaied him with the captains he 
knows and certified that after equivalent experience he will be 
equal to an average captain 

“14. Each officer below the rank of Brigadier General will be 
rated by his immediate superior Ratings will be revised or ap- 
proved by the immediate superior of the officer making the rating. 
The revising officer will use his own scale and make ratings in- 
dependently of those made by the rating officer. Superior officers 
will see that their subordinates make all ratings according to the 
Rating Scale system, m order that a just and equitable record may 
be had for all officers in the Armv. 
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Army Rating Scale 


“l. PHYSICAL QUALITIES 


“Physique, bearing, neatness, voice,-- energy, and endurance. 
(Consider how he impresses his men in the above respects.) 


Highest 
High 
Middle 
Low . 
Lowest 



IS 

12 

9 

6 

3 


'll. INTELLIGENCE 


“Accuracy, ease in learning, ability to grasp quickly the point of 
view of commanding officer, to issue clear and intelligent orders, 
to estimate a new situation, and to arrive at a sensible decision m 
a crisis, 

Highest 
High 
Middle 
Low .. 

Lowest 


"III. LEADERSHIP 

“Initiative, force, self-reliance, decisiveness, tact, ability to in- 
spire men and to command their obedience, loyalty and coon- 
erfltinn. •' ^ ^ 


Ij 


Highest 

High 

Middle 

Low 

Lowest 


‘iV. PERSONAL QUALITIES 

“Industry, dependability, loyalty, readiness to shoulder re- 
sponsibility for his own acts, freedom from conceit and selfish- 
ness, readiness and ability to cooperate. 

Highest IS 

Middle 9 

Low 6 

Lowest 3 



*S 

12 

9 

€ 

3 
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“V GENERAL VALUE TO THE SERVICE 

“His professional knowledge, skill and experience; success as an 
administrator and instructor, ability to get results ” 

Highest *5 


High 12 

Middle 9 

Low . . Sccd'X.euL.tn ^ . 6 

Lowest 3 


The rating scale is a master scale like a yardstick or thermom- 
eter with which the men to be rated are directly compared. 

One difficulty with the use of the scale is that each man’s scale 
is his own and may have only slight relation to the scale of some 
one else rating the same individuals For instance, the average 
man on A’s scale may be distinctly superior to the average man on 
B’s scale Consequently the man-to-man method does not avoid 
errors due to tendencies to rate too high or too low. Paterson and 
Ruml (84) emphasize the necessity for calibration of the “scale 
men” when the man-to-man scale is used extensively A similar 
trouble develops because scale men do not necessarily represent 
equal distances. Cleeton and Knight (22) make the point that 
man-to-man rating tends to avoid overestimation because one is 
making a direct comparison of individuals Overestimation, how- 
ever, can be surmounted by defining the per cent or number of 
individuals to fall in each class, or it may be corrected after the 
ratings have been made. 

The man-to-man scale has not had extensive use since the 
World War. Concerning this Scott and Clothier (92, p. 207) say, 

“The army form of Rating Scale has been found generally inap- 
plicable to business and industrial conditions by virtue of the fact 
that It IS relatively cumbersome in use The construction of the 
master scale and the mental balancing of one man against another 
call for an expenditure of time and effoit which the average execu- 
tive is not in a position to contribute It was found that in addition 
to fulfilling the need for accuracy, the industrial rating scale must 
fulfil the need for ease of opeiation. The success of any rating 
procedure necessarny~depends upon the good-will and the intelli- 
gent cooperation of the executives and foremen under pressure of 
daily routine. It is very difficult to win this good-will and coopera- 
tion when the mechanical difficulties involved in the use of the scale 
are great.” * 

• Scott, W D , and Clothier, R C , Personnel Management (A W Shaw 
Company, 1923), pp 207-208. By permission of the present publishers, McGraw- 
Hill Book Company. 
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The Graphic Rating Scale 

The graphic rating method, to which reference has been made 
previously, is probably the most serviceable and widely adopted 
method of rating. The cross-on-a-line method has been in use 
for some time Boyce (ii) used this method prior to 1915, al- 
though his is not a true graphic rating plan, inasmuch as he 
divided his line up into intervals so that the rater merely crossed 
the line to indicate in which mteival his judgment fell The true 
graphic rating scale developed by the Scott Company Laboratory 
consists of a straight line about four or five inches in length be- 
neath which aie written desciiptive adjectives or phrases to help 
define points along the scale The line is intended to represent 
the scale of the quality or trait being lated, extending from one 
extreme of the quality to the other The scale is marked b> 
placing a cross on the line anywhere It must be emphasized in 
gn ing the diiections for using the giaphic scale that the cross may 
be placed anyuheie along the line, not necessarily above one of 
the descriptive adjectives defining the points along the scale 

The graphic rating scale may be scored or evaluated by means 
of a homemade rule or scale A piece of cardboard may be used 
Take a straight edge of the cardboard and mark off points along 
the edge to include a length equal to the length of the line of the 
rating scale This space on the cardboard may then be divided up 
into intervals As was aigued on pages 78 and 79, seven intervals 
are sufficient, though any number of intcivals may be used. To 
the wi Iter’s knowledge graphic lating scales have been constructed 
with thirteen, eighteen, twenty, and twenty-two intervals, and 
doubtless examples could be found with other numbers of intervals 


1 J I 2 I 3 l4l5|6|7|8|9l 

Directions Place the stencil so that the scale coincides 
with the graphic line Note in which division the check 
falls Enter this number m the column at the right of 
the line 


The directions written on the stencil indicate how it is to be 
used There may be an advantage in keeping the graphic prin- 
ciple of Classing a line, but the line may be pre-maiked into ^ejen 
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divisions so that the rater meiely has to place his cross in one or 
another division 

Two graphic rating charts employing the graphic principle are 
reproduced here 

Rating Chart A 

Pupil’s Name Date 

School ... . . Giade 

Rated by . . . 


Directions for Using the Rating Chait 

1 Let these ratings represent your own judgment Do not con- 
fer with any one m making them 

2 In each trait or characteristic named below compare this 
pupil with the average pupil of the same age. 

3 In rating for any particular trait disregard every other trait 
except that one Many ratings aie rendered valueless because the 
rater allows himself to be influenced by a generally favoiable or 
unfavorable impression which he has foimed of the person rated. 
Do not rate a pupil high on all traits simply because he is excep- 
tional in some Children are often veiy high in some traits and low 
m others 

4 Place a cross somewhere on the line running from “very 
high” to “very low” to indicate this child’s standing in each qual- 
ity You may place your cross at any point on the line It is not 
necessary to locate it at any of the division points or above any 
descriptive phrase. 

5 Do not study too long over any one child Give for each the 
best judgment you can, and go on to the next. 

6 Give a rating for every trait 

7. The ratings will be held strictly confidential. 

Health — Is he generally healthy and vigorous? 


Bad Poor Average Good Excellent 

Leadership — Does he take the lead in school affairs or does he 
follow others? 


Always follows Rather tends Average Rather tends Masteily, 
others to follow to be a leader not easily 

influenced 

The usual method with such a scale is to go through the 
traits for one individual and then turn tho sheet to the next 'in- 
dividual. Even though instructions are given to rate all individuals 
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on one trait befoie tinning to the next tiait, no diiect com- 
parisons can be made, since only one page is visible at a time. 
To make use of the close comparison which is demanded in 
ranking, rating scheme B is recommended 

Rating Chart B 

Rated by . . . Date 

School Grade 

Directions foi Using the Rating Chart 

I. Let these ratings represent your own judgment. Do not 
confer with any one in making them. 

2 In each trait or characteristic named below compare this 
pupil with the aveiage pupil of the same age 

3 In rating for any particular trait disregard every other trait 
except that one. Many ratings are rendered valueless because 
the rater allows himself to be influenced by a generally favorable 
or unfavorable impression which he has formed of the person 
rated Do not late a pupil high on all traits simply because he is 
exceptional in some Children are often very high in some traits 
and low in others 

4. Place a cross in one of the compartments running from “very 
high” to “veiy low” to indicate each child’s standing in this 
quality. 

5 Do not study too long over any one child Give for each child 
the best judgment you can, and go on to the next 

6 Give a rating for each child 

7 The ratings will be held strictly confidential. 

8 Try to let the percentages guide you as to the number of 
cresses to fill in each compartment. 


Trait Health 

Is he generally healthy or vigorous? 


Pupil 

4 % 

Very 

bad 

\ 11 % 
Bad 

21% 

Poor 

28 % 

Average 

21% 

Good 

11% 

Very 

good 

4 % 

Excel- 

lent 

Charles . . 








William 








George 








etc 









Scale B is generally to be preferred because it keeps the desir- 
able features of the graphic rating chart and also permits the 
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close comparison of ranking. To be suie, the ratings for any one 
individual are spread over several sheets, but, as in any group test, 
the ratings for the several tests for any individual may easily be 
brought together when the numerical values of the graphic rating 
are measuied. 

Freyd (32, pp 93, 94) lists the following ads^antages of the 
graphic rating method. 

“It is simple and easily grasped 

“It is interesting and requiies little motivation of the rater. 

“It IS quickly filled out. 

“It flees the rater from direct quantitative terms 
“It enables the rater, nevertheless, to make his discrimination 
as fine as he cares, although this discrimination is lost if a scoring 
stencil of only a few points is used [This advantage is some- 
what illusory inasmuch as discrimination soon leaches a natural 
limit ] 

“It IS universal, that is, no master scale is required as in the 
Army Rating Scale 

“The fineness of the scoiing method may be altered at will, 
yielding scores of from i to 5, or fiom i to too 
“It allows of comparable ratings without requiring each rater 
to know all the members of the group.” 

Freyd (32, pp 99, 100) gives a list of rules for making the 
graphic rating scale Some of them, already mentioned in the 
earlier part of this chapter, will not be repeated Rules which 
apply paiticularly to the graphic rating scale are. 

“Decide on the extremes of the trait It is frequently the case 
that one extreme of a scale may have several opposites 

“It will be found good practice to introduce every scale with a 
question, to which the rating furnishes the answer, for instance, 
the question ‘How tactful is he^’ oi ‘Is he tactful or tactless?’ may 
be answeied by checking on the rating line 

“The rating line should be of such a length that a stencil for 
scoring the rating can easily be calibiated 

“There should be no breaks or divisions in the line 
“The line should not be much more than five inches m length, 
so that It may be grasped as a unit 

“There should not be more than five descriptive items nor less 
than three 

“The end phrases of the scale should not be so extremely worded 
as never to be employed 

“The phrase descriptive of the neutral or average degree of 
the trait should be in the center of the scale. 
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“If theie are five items, the intermediate ones should be closer 
in meaning to the center one than to the extremes This has the 
effect of spreading the distribution The same end may be accom- 
plished by making the mteivals on the scoring stencil smaller in 
the center than at the ends. 

“Only universally understood phrases should be used Slang 
is effective if there is no doubt as to its meaning 

“The desciiptive phiases should be set in small type, and there 
should be plenty of white space between them 
“The f a\ orable extremes of the scales should be alternated so as 
to do away with a tendency to check at one margin of the page.” 


The graphic rating scale which follows is taken from Filer and 
O’Rourke (31, p. 519) 

Graphic Rating Scale 

I It is requested that you indicate by check ( V ) your opinion of the 
.ipplicant m each of the qualities specified Place only one check after each 
quality For example, on the specimen scale below, the check mark indicates 
that the supposed applicant is in the class which “Learns and adapts slowly” but 
I' moic nearly average than dull, because the check is placed nearer the 
“aveage” group than the “dull ” 


tUILITY TO LE \RN 

Consider ease and 
lapidity of under- 
standing new in- 
structions and 
. dapting to new 
situations 


SPECIMEN — DO NOT MARK HERE 


V 


Dull and 

Learns and 

Average in 

Learns and 

Learns 

unadapta- 

adapts 

learning 

adapts 

with ex- 

ble 

slowly 

and adapt- 

readily 

ceptional 



mg 


ease and 





rapidity. 


Answer All of the Following 

In giving your opinion on a particular trait, disregard for the moment every 
trait but that one, as specifically defined, and consider the applicant’s ability 
in this trait from the point of view of GENERAL CLERICAL work only 

(a) ABILITT TO 
I EARN Consider 


ease and rapidity 
of understanding 

Dull and 

Learns and 

Average 

Learns and 

Learns 

new instructions 

unadapta- 

adapts 

learning 

adapts 

with ex- 

and adapting new 

ble 

slowly 

and adapt- 

readily 

ceptional 

situations 



mg 


case and 
rapidity 
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(b) INDUSTRY 

Consider energy 
and application to 
duties day ui and 
day out 


Lazy 

Indifferent 

Average 

application 

Industri- 

ous 

Unusually 

energetic 

(c) INITIATIVE 

Consider ability to 
go ahead with 
work without be- 
ing told every de- 
tail and to make 
practical sugges- 
tions for doing 
work in a better 
way 


Needs con- 
stant di- 
rection 

No origi- 
nality 

hlinor con- 
structive 
ability 

Considera- 
ble con- 
structive 
ability 

Highly con- 
structive 

(d) COOPERATIVE- 
NESS Consider abil- 
ity to maintain 
good working re- 
lations with co- 
workers 


T rouble- 
maker 

Causes 
slight fric- 
tion 

Indiffer- 

ent 

Coopera- 

tive 

Exception- 
ally co- 
operative 

(e) ATTITUDE TO- 
WARD WORK Con- 
sider voluntary in- 
terest and effort in 
work 


Uncon- 
cerned and 
no volun- 
tary effort 

Interest 
and effort 
below 
average 

At erage 
interest 
and effort 

Interest 
and effort 
above 
average 

Shows keen 
interest 
and whole- 
hearted 
effort 

(f) SPEED Con- 
sider rate at which 
applicant is able to 
work 


Very slow 

Slow 

Average 

rate 

Fast 

Exception- 
ally rapid 

(g) A C C URACY 

Consider ability to 
do work without 
errors. 


Unsatis- 

factory 

Makes 

many 

errors 

Average 

accuracy 

Seldom 

makes 

errors 

Exception- 
ally accu- 
rate 

(h) DISPOSITION 

Consider natural 
temper of mind 


Decidedly 

ill-natured, 

uncivil 

Easily 

vexed, 

moody 

Average 

sclf-rc- 

strainl 

Rarely 

vexed 

Excep- 
tional self- 
control 
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(1) NEATNESS 

Consider orderli- 
ness in work. 


Disorderly. 

Soniewhat 
below 
average m 
orderliness 

Average 

orderliness 

Somewhat 
above av- 
erage or- 
derliness 

Exception- 
ally or- 
derly. 

(j) ABILITY TO 

SUPERVISE Con- 

sider ability to di- 
rect ork of others 
effectively. 


Unable to 
direct 
work of 
others 

Somewhat 
below av- 
eiagc abil- 
ity in di- 
recting 
othe«‘S 

Shows av- 
erage abil- 
ity in di- 
recting 
others 

Somewhat 
above av- 
erage abil- 
ity in di- 
recting 
others 

Maintains 
loyal and 
cflcctive 
working 
force 


In their “Behavior Pioblem Record,” the giaphlc rating scale 
has been adapted by Haggerty-Olson-Wickman * for US“ m record- 
ing judgments directly as to the frequency of certain types of 
behavior. 

SCHEDULE A BEHAVIOR PROBLEM RECORD** 
Directions for Using 
Schedule A 

Below IS a list of behavior problems sometmic' found in children Put a 
cross (X) in the appiopnatc column after each item to designate how fre- 
quently such behavior has occurred m yovr expenenee with this child A cross 
should appear in some column after each item The numbers are to be disre- 
garded in making your record They are for use in scoring 


BEHAVIOR PROBLEMS 

TREQUENCY OF OCCURRENCE 

SCORE 

Has 

never 

occurred 

Has oc- 
curred 
once or 
tzvice bzit 
no more 

Occa- 

sional 

occur- 

rence 

Frequent 

occur- 

rence 

Disinterest m school work 

0 

4 

6 

7 


Cheating 

0 

4 

6 

7 


Unnecessary tardiness 

0 

4 

6 

7 


Lying 

0 

4 

6 

7 


Defiance to discipline 

0 

4 

6 

7 



(Etc for fifteen items) Total Score- 


Olson, W C, Problem Tendencies m Children (The University of Minne- 
sota Picss, 1930) 

Haggerty, ME ‘ H ; Incidence of Undcsiiablc Behavior in Public School 
Children,” Jownnl of Educational Research, 12 102-122 (Sept ip^S) 

''From llaggtiij, Olson and Wickman's Uaggeity-Olson-n'iikman Behavior 
Rating Schedules Cop> light, 1930, by the Amciican Council on Education and 
published by W’oild Book Company, Yonkcis-on-Hiidson, New York 
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SCHEDULE B BEHAVIOR RilTING SCALE* 

Directions tor Using 
Schedule B 

1 Do not consult any one m making your judgments 

2 In rating a person on a paitieulai trait, disregard etery other trait but 
that one Many ratings are rendered valueless because the rater allows himself 
to be influenced by a general favorable or unfa\orable impression that he has 
foinied of the person 

3 When you haie satisfied yourself as to the standing of this person in the 
trait on which you are rating him indicate j'our rating by placing a cross ( X ) 
immediately aboie the most appiopriate dcscriptne phrase 

4 If you arc rating a child, try to make your ratings by comparing him 
with children of his own age 

5 The masculine pronoun (he) has been used throughout for convenience 
It applies whether the person whom you are rating is male or female 

6 In making your ratings, disregard the small numbers w'hich appear below 
the descriptive phiascs They arc foi use in scoring 

nivisiON I 

1. How intelligent is he? Score 


Feeble- 

Dull 

Equnl ol avetage 

Bright 

Brilliant 

• 

minded 


child on street 



(s) 

(4) 

(3) 

(2) 

(I) 



2 Is he ahstiaeted 01 wide awake? 


Continually 

Fi cqucutly 

Usually 

Wide- 

Keenly 

absoibed m 

becomes 

pi esent 

awake 

alive and - - 

himself 

abstrncted 

minded 


alert 

(5) 

(4) 

(2) 

(1) 

t3) 

3 Is his attention sustained? 




Disti acted Jumps 

DifhcuU to 

1 

Attends 

Is absorbed 

Able to hold 

rapidly from one 

keep at task 

adequately 

in what be 

attention for ■ - ■ 

thing to another 

until completed 


does 

long periods 

(5) 

(4) 

(3) 

to 

(2) 

4. Is he slow or quick in thinking? 



Extremely 

SluJcisli. 

Thinks with 

Agile- 

Exceedingly 

slow 

Plodding ordinary speed 

minded 

rapid ■ 

(5) 

(4) 

(2) 

to 

(3) 

5 Is he slovenly or careful in his thinking 

7 


Very slovenly 

Inexact, 

Idoderately 

Consistent 

Precise 

and illogical 

A dabbler 

careful 

and logical 


(5) 

(4) 

(2) 

to 

(3) 


(Etc 

for thirty five 

Items ) 



Total, Dtmston 1 

•From Haggerty, Olson, and Wickman’s Haggerty-Olson-Wickman Behavior 
Rating Schedules Copyright 1930 by the American Council on Education and 
published by World Book Company, Yonhers-on-Hddson, New York, 
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Anothei sample of the graphic rating scale is the “Personal- 
ity Rating Scale’' shown on pages 70 and 71, devised by a com- 
mittee of the Aineiican Council on Education for use in colleges 
This scale iccogni7cs many of the safeguards and cautions which 
Its makeis had disco\eicd in their experience with ratings, among 
them being “Only tiaits observed by the rater should be rated, 
only traits foi which no valid objective measurements are now 
available should be rated, the number of traits to be lated should 
not exceed five, if teacheis are to be expected to rate the traits 
of a laige number of students, traits should be mutually exclu- 
sive, a tiait should not involve unrelated modes of behavior” 
Careful study of a large number of rating scales actually in use 
lesiilted in the selection of five qualities to be included in the scale 
which may be named as follows Personal Charm, Initiative, 
Leadership, Emotional Control, Responsibility. 

The Check List 

This device, originally found promising by Hepner (44), was de- 
veloped by Hartshorne and May (40) with considerable success 
One hundred and sixty words descriptive of approval or disap- 
proval weie employed, and were divided roughly into two lists of 
eighty words each For each w'ord in one list its antonym appeared 
in the othei list, but both positive and negative words appeared 
on both lists Samples of the words are as follows 

Brutal Undiscerning 

Resourceful Shirker 

Courteous Virtuous 

Frivolous Dignified 

etc. 

A teacher in rating a pupil reads through the list and places a 
check mark against each word which can honestly be applied to 
the person in question The directions require that the teacher con- 
sider each word, checking as many as she wishes to, or checking 
none at all if none apply. In scoring, the number of negative 
woids checked is subtracted from the number of positive words 
checked, and the resulting score comes out either -f- 1, o, or — i 
according as there is a balance of positive or negative words 

' Robertson, D A , chairman of Committee on Personality Measurements, 
Pcisonncl Methods,” The ^Educational Record, Supplement 9 no 8, July, 1928, 
P S4 
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checked By a scoiing method adopted later, the words in the lists 
aic weighted cmpiiically and more detailed scuies aie used The 
reliability of the technique is reported by Haitshoine and May as 
88 for total leputation, 74 when used to latc scivice, 64 when 
used to rate persistence, and 76 when used to rate inhibition. 

The “Guess Who” Test 

Haitshorne and May have devised a most promising technique, 
utilizing to good advantage a fact which is to be bi ought out later 
in the discussion of reliability, namely, that extreme ratings are 
especially reliable The diiections foi using this device, called the 
“Guess Who Test,” are as follows (40, p 87) 

“Here are some little word-pictures of children you may know. 
Read each statement carefully and see if you can guess who it is 
about It might be about yourself Theie may be more than one 
picture for the same person Several boys or giils may fit one pic- 
ture Read each statement Think over your classmates, and write 
after each statement the names of any boys or girls who may fit 
It If the picture does not seem to fit any one in your class, put 
down no names, but go on to the next statement Work carefully, 
and use youi judgment 

“i. Here is the class athlete He (or she) can play baseball, 
basketball, and tennis, can swim as well as any and is a good 
spoit ” 

(2 , etc ) 

Several methods of scoring the device weie tried, both by Hart- 
shorne and May, and later by the writer The method which 
seemed to yield the best results was the simple one of merely 
summing the total number of times a pupil is mentioned for any 
of the descriptions, giving “good” items a positive value and “bad” 
Items a negative value It might be expected that some one pupil 
m a class would be recognized by so many classmates as fitting 
a description that he would receive a preponderance of the votes, 
but actual experience shows that many children are usually recog- 
nized for a given description It was also found that a child who 
received a large number of votes on any one question tended also 
to be mentioned for a large number of items The results yielded 
a symmetrical distribution even though slightly peaked Hart- 
shorne and May report a reliability of 95 on the total test and .88 
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when measuring service. The writer 
found a reliability of 88 in his use of 
this device 

Sclf-Oidinaiy-Ideal Rating 

The tendcnc)f to overrate oneself on 
desirable traits (discussed in greater de- 
tail on page 109) suggested to Knight 
and Franzen (66) a method of measur- 
ing laiious phases of personal malad 
justment. G B. Watson expeiimented 
further with the method, and finally 
Sweet (100) made a comprehensive 
analj'sis of its possibilities His studies 
have resulted in the “Peisonal Attitudes 
Test” A better name to distinguish the 
method might be “Self-Ordinary-Ideal 
Rating,” although in use with boys as 
subjects this name would lack the merit 
of disguising the nature of the test pos- 
sessed by Sweet’s title. 

Many methods for recording the items 
could be devised The first item is 
marked as one boy might have marked 
it In Sweet’s test the items were printed 
as they appear on this page ( 100, p 6) 

The test is scored in seven different 
ways to yield scores that are called 
“Self-Ciiticism,” “Criticism of the Aver- 
age Boy,” “Feeling of Difference from 
the Average Boy,” “Feeling of Superior- 
ity,” “Feeling of Inferiority,” “Devia- 
tion from Accepted Ideas of Right,” 
and “Social Insight.” In order to make 
the explanation of these scores as simple 
as possible, ratings on “How I Feel” 
will be called “Self”; ratings on “How 
Most Boys Feel” will be called “Ordi- 
nary”; and ratings on “How I Think I 
Ought to Feel,” “Ideal.” 
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Scores for Self-Cnticism are found by counting the number of 
times “Ideal” is different from “Self” The higher the score, the 
more the boy tends to depreciate himself; the lower the score, the 
more peifect the boy says he is. 

Cnticism of the Average Boy is found by counting the number 
of times “Ordinary” differs from “Ideal” When this score iyhigh, 
the individual rates the group below the ideal frequently and 
hence tends to criticize it. When the score is low, the averse boy 
IS rated as being close to the ideal 1' 

Feeling of Difference is found by counting the numbci^liof times 
“Self” is marked differently from “Ordinary ” When this is high, 
the boy feels himself to be diffeient from most beys, when low, ne 
feels himself to be an average or typical boy 

Superiority and Inferiority constitute two separate scores When 
“Self” IS rated neaier to the “Ideal” than “Ordinary” is, one 
point IS credited to Supeiioiity. When “Ordinaiy” is laled ncarei 
the “Ideal” than “Self” is, one point is credited to Infeuonty. 
These two scores should be inteipreted in conjunction with one 
another. 

Deviation from the Accepted Idea of the Right is found by 
counting the total number of items in which 20 per cent or less 
of the group have marked “Ideal” as in the same manner as the 
individual being scored This requires determining the frequency 
of the distubution of replies to “Ideal” for the group taking the 
test and marking on a fresh sheet those replies made by 20 per 
cent or less of the group High scores on this factor indicate that 
the individual diverges fiom the ideal of the group in which 
he IS tested, low scores indicate that he approaches the common 
ideal. 

Social Insight is measured by counting the number of items in 
which 20 per cent or less of the group answer “Self” as the indi- 
vidual himself marks “Ordinary” and subtracting the result from 
51 To obtain the score, a frequency distribution of the “Self” 
scores made by the group is necessary. “High Social Insight” 
scores indicate that'the boy believes that boys feel very much as 
they actually do say they feel, low scores indicate that the indi- 
vidual whose paper is being scored misjudges by his answers to 
“Ordinary” how other boys feel as indicated by their answers to 
“Self” 

These scores are highly reliable, as is shown in the following 

s 
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table of average reliability coefficients from four groups with from 
50 to 136 boys m each group. 


Table 4 


Reliability CoLmciLNTs irom Person yl Attitudes Test 
(Sele-Ordinary Ideal Rating) 


(from Sweet, 100, p 48) 


bjjf-criticism 

Average 

rehabdity 

coefficient 

914 

Ciiticism of others . 

939 

Feeling of diffcicnce 

93S 

Supcrnrity , ^ 

936 

Inferiority 

78s 

Deviation from accepted idea of right 

86 

Social insight 

87 


Certain of these scores, particularly the criticism of self, criti- 
cism of others, and a combination of superiority-inferiority scores, 
showed promise of being diagnostic of the picsence of peisonality 
and behavior problems when checked against psychiatric case 
studies and the ratings of social workers. 

Comparison of Rating Methods 

Various factois should be considered in determining which 
method of rating is best Objectivity and icliability play an impor- 
tant part Cqiitenience, ease in handling, comprehensibility of 
mylmd, and freedom from vaiiation are all matters to be con- 
sidered 

In general it may be claimed that the different methods are 
equally objective Objectivity in rating depends on the method 
by which the rater makes his observations, and draws his infer- 
ences and on the cleverness with which the positions on the scale 
are defined rather than on the way in whrh the judgments are 
recorded Baiiett (7) assures us that the oider-of-merit me thod 
and the method of paired comparisons are equally good. The 
present writer (loi) has determined that rating and ranking are 
equally reliable under ordinary conditions in school. The choice 
of method in recording one’s judgments depends on other factors 
than reliability. 

One advantage claimed for ranking, paired compajisons, and the 
graphic method over the method of assigning definite scale values 
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IS that discriminations may be made as fine as are desired, at least 
up to a certain degiee However, it is possible to push the mat- 
ter of fineness of discrimination too far One iiho attempts to 
rank thirty 01 forty individuals in some tiait finds that, after rank- 
ing four or five outstanding individuals who can readily be ranked, 
the rest of the cases seem much alike, and accurate rankings be- 
come very difficult to assign Sometimes ranks are the result of 
foiccd discriminations w'hich aie not actually felt by those who 
hate been required to make them Foi this reason rajiking be- 
comes irksome when the number of cases to be ranked is large 
Rating, particularly on a giaphic scale, is more pleasant 

Some consider the method of ranking easier to explain and 
easier to understand than the method of rating Ranking usually 
gives a false notion concerning tlie form of the distribution of the 
tiait being rated, because ranking yields a rectangular distribu- 
tion, whereas seldom if ever do traits for any group assume a 
rectangular distribution The most likely assumption we can make 
about any distiibution is that it approximates the noimal prob- 
ability curve In ranking, the end cases are too close together and 
the middle cases are too far apait Howevei, by statistical methods 
the ranks may be turned into numbers which yield a distribution 
having the shape of the normal distribution The method for doing 
this is explained on page 91 

Knight * objects to rating on the ground that rating may be 
b iased, 1 e., be either too low or too high — ^usually too high. He 
says, “Judges who are presumably equally competent rate the 
same person as almost wholly honest, half honest, rather dis- 
honest, or an out-and-out cheat This disagreement is the product 
of legitimate variations in knowledge about the person and of 
varying standards of honesty held by the judges themselves The 
way to avoid this pitfall is not to use a rating involving concepts 
of virtues, but a system of ranking persons accoiding to their rela- 
tive merits The question should be not, ‘How honest is he^’ but 
‘Is he more or less honest than this person or that^’ ” 

One teacher might give a class an average rating of 4 on a 
rating scale in honesty and another teacher give the same class 
an average rating of 2, and yet the two sets of ratings might cor- 
relate perfectly. This tendency to r ate to o high or too low may be 

* Knight, F B , “Analysis of Teaching and Teaghers,” Jour of Educational 
Research, lo 224 (Oct, 1924) 
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corrected (see p. 97), so that rating need not be discarded for this 
leason. 

Conklin and Sutherland (24) have pointed out that rating is 
better for getting an immediate affective impiession, such as the 
humoi value of jokes In rating, the values once assigned are not 
easily changed In rankmg, on the other hand, it is easy to shift 
the position of a caid in the pack and thus change the original 
Impiession Using the same aigument, rankings should be better 
for recording mature or pondeied judgment. 


Cardinal Points to Be Observed in Rating 

The number of scale-divisions in the rating scale. The two 
rating scales by Galton and Pearson given as examples contained 
nine and seven scale-divisions respectively. Compaie these with 
the data of the following table, from Boyce (ii), showing the 
number of scale-divisions in fifty-four teacher-rating schemes 
which he examined 

Table S 

Distribution Showing Number of Scale-Divisions in Fiftt-four 
Rating Scales* 

(from Boyce, II, p 2o) 

Number of scale divisions m rating scale Frequency 


2 . 2 

3 II 

4 . . . 24 

5 • 12 

6 . ... . 2 


7 I 

54 

* From Boyce, A C , “Methods for Measuring Teachers’ Efficiency,” 14th 
Yearbook, National Society for the Study of Education (1915), Part II Re- 
printed by permission of the Society 

In this matter there are forces pulling in two directions. On the 
one hand, for accurate measurement we want as fine discrimlna - 
tipn as possible and hence as many scale-divisions as possible. 
One of the advantages claimed for the graphic method is that it 
peimits as fine discrimination as may be desired. In the other 
diiection there are factors of economy m effort. Naturally it takes 
moie effort to sort persons Into five divisions than into three. There 
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is an upper limit to fineness of discrimination and it is possible 
Lo have one’s scale divided too finely for feasibility in piactice 
We do not need a yardstick with inches divided into tenths if 
the scale must be read at a distance of fifty feet Just as we may 
have a scale divided finer than the eye can read, so it is possible 
to devise a rating scale with moie classes than any one can dis- 
criminate The potential power of disci imination with the graphic 
scale may be gieatly ovei estimated, since one’s powers of judg- 
ment will fail long befoie the divisibility of the scale is exhausted 

The present wiiter (102) attempted to determine the optimum 
number of classes for a lating scale He showed that the decisive 
point at issue is the loss of reliability which one will accept due 
to coaiscncss of the scale Setting up the arbitiary standard that 
a loss in reliability will be accepted equivalent to the change from 
a reliability of 91 to one of 90, the author computed that ^yen 
IS th e optimum number of classes for lating human traits. With 
less than seven classes the coaiscncss of the rating led to an_ 
appreciable loss in leliability greater than his aibitrary standard 
oFa llowable loss_ Moie than seven classes yielded such a small 
increment in reliability abo\ e what would be obtained with seven 
classes that the attempt to make the finer discrimination turned 
out not to be worth uhile 

Although seven classes is an average optimum number for a 
rating scale, conditions aic often such that more or fewer classes 
are justified If the tiait is an obscure one such as impulsiveness, 
or if the laters are untrained or take only a model ate interest in 
the task, rating scales with more than three or four classes are 
inexpedient If a scale consists of seveial items which are cumu- 
lated to make the scoie on the scale, it is not necessary to rate 
each Item as accurately as though each stood alone 

On the other hand, if one is rating fairly objective functions 
such as difficulty of words in spelling, or if the raleis are enthusi- 
astic and tiaincd, moie than sc\en aie justified In general, table 
6 oh the next page may serve as a guide in determining the 
number of classes. 

It IS a curious fact that there seems to be a natural reluctance to 
use seven classes in rating. In school marks, for instance, it is 
customary to use a five-letter system. However, if we undertake 
to measure at all, we should measure as ^curately as possible. 
The more accurately we rate, the less injustice we shall do to the 
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Number 


Table 6 

or Classes for Rating Scales Yielding Different Reliabilities 


Obtained reliability 

Number of classes desirable 

9 S 

18 

90 

H 

80 . 

. II 

70 

..9 

60 ... 

. .. 7 

50 

. . 6 

4.0 

. . S 

30 .. 

.... 4 

20 

3 

10 . . 

. . 2 


individuals being rated. Accordingly seven classes should be pre- 
ferred to five classes in rating human traits. 

The number of individuals in each class. Ratings can be 
studied by bearing in mind the approximate numbers of individ- 
uals who as a rule should fall in each class This does not in the 
least bind the rater, but it does serve as a guide and will help to 
make ratings by different examiners comparable Of the various 
inevitable assumptions which must be made in ratings, the most 
probable assumption is that the group is a typical one and that 
consequently the members of the group distribute themselves 
normally tvith respect to the tiait Table 7 on the next page, 
derived from tables of the probability integral, may be used for 
guidance in determining the number of individuals who should 
fall in each class A consistent use of these figures will do much 
toward increasing the value of ratings and making the ratings by 
different observers comparable 

Rating one quality at a time. One of the two methods of rat- 
ing a group IS to rate an individual through all the traits on the 
sheet, and then pass on to the next individual Another way is to 
rate all individuals through on a single trait before passing on to 
the next trait It is gcneially conceded to be preferable to rate one 
trait at a time, because a peculiar phenomenon occurs as one rates 
an individual on several traits. Either through mental inertia on 
the part of the rater or because an individual usually seems to pre- 
sent a sort of uniform level on all sides of his personality to the 
observer, it has been noted that the ratings of an individual tend 
to be monotonously alike. If the individual is high in one trait he is 
likely to be high in all; if low in one, low in all. It is demonstrably 
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Table 7 

Percentage Distributions tor Groups of Different Sizes and for Scales 
Having Different Numbers of Classes 


class of 40 (range S sd ) 


3 classes 

4 classes 

S classes 

6 classes 

7 classes 

20% 

11% 

7% 

S% 

4% 

60 

39 

24 

15 

10 

20 

39 

38 

30 

22 


II 

24 

30 

28 



7 

IS 

22 




S 

10 





4 


CLASS 

OF 185 (range 6 s d ) 


3 classes 

4 classes 

S classes 

6 classes 

7 classes 

16% 

7% 

4% 

2% 

2% 

68 

43 

24 

14 

8 

16 

43 

44 

34 

23 


7 

24 

34 

34 



4 

14 

23 




2 

8 





2 


true that our ratings for any one trait are colored by the general 
impression of the individual which we have formed This tendency 
to uniformity docs not appear where one trait is rated through at 
a time, for in the latter case there is a direct comparison of sev- 
eral individuals with respect to a single trait, with results which 
are distinctly superior to comparing several traits in a single in- 
dividual 

Selecting the qualities to be rated. If one is rating a narrow 
and definite habit or attitude or trait, this problem does not arise. 
If, however, one is rating for the purpose of estimating efficiency 
or the value of individuals for a certain kind of work, or if one is 
rating pupils on their ability to do school work, careful attention 
must be given to the qualities to be rated. Some qualities are quite 
unimportant, at least for the purpose for which the rating is being 
done A clerical worker need not be rated, for instance, on his 
temperament, nor a teacher on thrift, nor a school pupil on rever- 
ence. Some qualities are relevant, others are not It is possible to 
rate “general efficiency,” but there is no evidence to show that this 
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method does not yield results inferior to a more analytical sort of 
rating. Tt seems reasonable to suppose that a summation of the 
ratings of the analyzed components of “general efficiency” should 
be more valuable than a blanket rating Separating the rating into 
separate qualities helps to standardize the lating If the rating is 
on general efficiency, then such factors as a disfigured face or a 
brusque manner may overweigh oi piejudice the estimate If such 
factors as intelligence, industry, skill, and cooperativeness are the 
important factors m determining a man’s value, rating on these 
factors would result in more representative estimates than would 
be obtained if each ratei used his oun judgment as to what fac- 
ials were important. Ratings on seveial factois may not yield 
results which are much moie leliable than rating on a single 
factor, but the former method should help eliminate personal bias. 

The analysis of man’s personality into separate traits is fraught 
with difficulties One never knows how much one is drawing on his 
imagination in making such an analysis He does know whether a 
careful analysis into qualities would yield the same results as an 
experimental determination of the analysis The most approved 
method of choosing the items for a rating scale is to rate the group 
of persons on whom the scale is finally to be used both on general 
efficiency and also on a wide variety of sepaiate qualities Those 
qualities correlating most highly uiih the cuteiion (uhich should 
be an independent measure of general efficiency) would be the 
ones to be used in the rating scale 

Care must be erercised in selecting the items for a rating scale 
that they do not overlap Often apparently different qualities 
turn out to be practically identical. For instance, school attitude, 
effort, and industry are identical to all intents and purposes. More- 
over, one should be careful not to permit obscure designations of 
the qualities to be rated Persons with inadequate psychological 
training often make fantastic analyses of peisonality 

Weighting qualities to be rated. In some of the older work 
with rating, when score cards were more frequently used, more 
attention was paid to weighting the qualities to be rated. Naturally 
every one would agree that in the case of a clerical worker, speed 

* Strictly speaking, the intercorrelations between all traits should be found 
and legression weights computed which will make the best weighted com- 
posite and hence the highest correlation with the criterion, then those yielding 
the highest regression weights for predicting the criterion would be chosen for 
the scale. 
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and diligence and intelligence are moie important than sense of 
humor, taste in dress, or politeness. A pioperly constructed score 
card is a reminder of relative values In determining the weights 
to be given to the items in a score caid, it is customaiy to get com- 
posite judgments. Usually too or 1,000 points are decided upon for 
the score card, whereupon the list of qualities is sent to competent 
and interested persons who are requested to distribute the lOO or 
1,000 points among the items With very slight adjustment, the 
average of the allotments may then be used as the weights. 

Another and more difhcult yet more scientific method of as- 
signing weights IS to rate a number of peisons both by a blanket 
rating of efficiency and by the items on the card Correlations with 
the criterion and intercorielations between the items may be used 
to determine the regression weights to be used as the weights of 
the items in the scale. 

Probably in oidinaiy work where ratings are as unreliable as 
they usually are, the matter of weighting is relatively unimportant, 
though It should be remembered that the matter of weighting can- 
not be Ignored and that in a true sense the items are always 
weighted, since they are weighted equally if nothing else But be- 
cause the method of obtaining regression weights is difficult and 
expensive, we assure ourselves, with probable justification, that 
some more approximate method of determining weights is usually 
quite as satisfactory. 

Number of items in a rating scale. A further issue that must 
be decided is the number of items to include in the scale If one is 
making a score card for the purpose of analyzing school buildings, 
the extent of the analysis is determined partly by the functions of 
the different parts of a building But even in this instance judg- 
ment must be used as to how detailed the rating is to be In gen- 
eral we do not want to make the lating sheet too detailed, with 
the resulting increase in the labor of filling it out The issue re- 
duces Itself to the matter of the gam in reliability and validity that 
accrues from the summation of ratings over and above what would 
be obtained from a blanket rating Although there is no precise 
evidence on this issue, the present writer’s opinion is that the value 
of the analysis is usually achieved quickly, so that in general a rat- 
ing scale need not contain over three to five items. The increased 
accuracy which is gained from more detail^ ratings is probably 
not very great. 
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Items in a rating scale must be observable. In selecting qual'- 
ities to be rated, it is important to recognize that they should be 
qualities on -which the observer has an opportunity to obtain data. 
For instance, an interviewer might be in an excellent position to 
give a rating on appeal ance, but would not be able to form any 
judgment as to cooperation or industry In choosing items for a 
teacher-rating card to be used by a supervisor, it is necessary to 
make sure that the items are such as can be observed during a 
class-room visit. One could easily observe and form a judgment as 
to appearance, voice, or use of English during a supervisory visit, 
but one might obtain no evidence peimittmg an evaluation of the 
giowth of pupils with respect to subject-matter, definiteness and 
clearness of aims, optimism or extra-mural interests It is aston- 
ishing how often rating scales are drawn up asking judgments for 
vv'hich the rater is not m a position to gather evidence 

Definition of qualities to be rated. Particular attention must 
be paid to the definition of the items in the scale On this hinges 
much of the success or failure of ratings in general One of the 
most potent factors causing unreliability of ratings is ambiguity in 
meaning of the items on the scale Every rating is a judgment, 
judgments depend not only on observation, but also on the inter- 
pictation according to certain standards of what is observed In- 
telligence IS a good example of a teim which has an ambiguous 
meaning To one man it will mean “adaptability,” to a variety of 
other men it may signify “abstr.actncss,” or “agility,” or “ability to 
learn,” or “possession of general information,” or “constructive- 
ness,” or “ability to put things together” Character is another 
ambiguous term often used on recommendation forms It may 
mean “moral character,” or “trustworthiness,” or “sexual purity,” 
or “prompt payment of bills,” or “cooperativeness,” or even “gen- 
eral value to society.” School attitude is still another ambiguous 
term Interpreted as anything from “industry” to “docility” In 
every rating scale the items should be defined in some way There 
are several possible ways of doing this. One, perhaps the least sat- 
isfactory, is to give synomyms of the original term. Another is a 
short paragraph amplifying the descriptive title. Another method is 
to ask a question which not only limits the meaning of the term but 
somehow helps the rater to see the problem of rating more clearly. 
Still another method is that of describing a person who possesses 
the trait under consideration. 
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Objective vs. subjective definition. Paterson (79, p. 83) em- 
phasizes the point that the trait should be defined objectively 
rather than subjectively A subjective definition is in terms of the 
man’s personality and traits, an objective definition is in tcims of 
what the man does and particularly what effect the man has on 
the person or things with whom he comes in contact. Paterson 
gives examples of objective and subjective definitions. 

Leadership 

Subjective definition Rate this executive’s force, self-reliance, 
decisiveness, tact, ability to inspire men and to command their 
obedience, loyalty and cooperation 
Objective definition Rate this executive according to the success 
he has shown in developing a loyal and effective oiganization by 
administering justice, inspiring confidence, and winning the co- 
operation of his subordinates. 

Appearance 

Subjective definition Personal attractiveness, cleanliness, neatness, 
dress 

Objective definition; Consider how favorably he impresses his 
men by his physique, bearing, and manner 

Many writers have stressed the point that the definition of 
Items on the rating scale should be objective. Abstract traits stand- 
ing alone may be highly subjective, but when related to a definite 
situation they may acquire objectivity For instance, independent, 
standing alone, might be useless in describing personality, yet 
might be very significant when used in connection with specific 
situations such as scientific research. Hughes (54) pleads for a 
behavioristic definition, by which he means a definition more in 
terms of what a person does and what products or effects result 
from his activity than in mere descriptive terms Paterson (82) 
urges that the rater keep in mind past or present accomplishments 
of the individual being rated. Judgment should be reached, not 
on the basis of hoped-for achievement or possible development 
but on the past record of the candidate as reported or observed. 

Not only the trait itself, but the degrees of the trait should be 
clearly and sharply defined Miner (78), an^arly student of rat- 
ing methods, stated that the difficulty of defining verbally the de- 
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grees of ■> trait was so great that it was better to avoid qualitative 
terms and substitute relative or quantitative tcims, as by a scale 
with units equal to fifths of the gioup This emphasis on a direct 
comparison and grouping of individuals is sound, but this method 
belongs primarily in the hands of the skilled rater Later writeis 
plead for clearer definitions of the degrees of a tiait It has been 
suggested that the words average, fairly, veiy, exceedingly , and 
extremely should be avoided in favor of more descriptive adjec- 
tives Fieyd (32) gives illustrations fastidious, in place of ex- 
tiemely neat, slovenly, in place of very caicless in diess On the 
other hand difficulty of vocabulary should be considered, with 
exclusions of unusual woids even though they are highly descrip- 
tive and meaningful Other things being equal, the more common 
the descriptive adjective used tlie better Probably the best results 
are obtained from making the definitions of the degrees of a trait 
objective. 

Statistical Treatment of Paired Estimates and 
Rankings 

Ratings are usually treated at their face vahu "he steps on a 
rating scale which has been assembled hastily au only app xi- 
mately equal However the errors due to unreliability of ra^mg 
are greater than errors due to carelessness of scaling, so that the 
latter can for practical purposes be disregarded. 

The method of paired estimates needs statistical treatment be- 
fore the seveial individuals judged may be assigned a numerical 
rating As the judgments aie made in the methods of paired 
estimates, each individual is stated to be better or worse than each 
other individual m turn These judgments must then be manipu- 
lated so that each individual acquires a position along a linear 
scale. 

Thorndike (109) was the first to propose a definite technique 
for cpmbinmg order-of-merit judgments. His method essentially 
was first to place the individuals in their order by inspection, then 
to determine the true difference in position between each pair by 
finding the percentage of judges who rated the former individual 
as lower than the latter, and finally computing the amount of 
difference represented by the percentage If individuals were trans- 
posed, le, wiongly placed by the original inspection, they must 
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be shifted, and the calculations concerning each individual af- 
fected by the shift recomputed Thoindike directs (io6, p iqB) : 
“In connection with this amendment it will be useful to compute 
also the differences from the two or thiee next neighbors in the 
new order. For if an individual is rightly placed, he will not only 
be below the one placed above him in the order, but also below 
the two or three next above him A process of trial and adjustment 
will be more economical than a rigid pioccdure in this work ”... 
(109, p. 204): “Perhaps some single system of over weighting the 
next neighbor comparison would be advisable On the whole, how- 
ever, we have in this technique an economical and fairly reliable 
means of combining partial orderings ” It will be noted that the 
method proposed by Thorndike is by no means an exact one. 

After the differences are recorded in percentages, these per- 
centages may be turned into absolute amounts A unit of absolute 
difference is defined as that corresponding to a difference noticed 
by 75 per cent of the judges. If desired, an arbitrary scale may 
be assumed and individuals placed on the scale by adding the 
absolute amounts of difference. 

Table 8 

The Amounts or Difference (v — y ) Corresponding to Given 
Percentages of Judgments That a: > y 
% f = the percentage of judgments that * > y 

A ttriiir t ^ nr 

■ — — A yj 4u ui LUC U1UV1L41CC duuii LiiaL /V r 


%r 

A 

Fe 

%T 

A 

pT 

%r 

A 

Pi' 

%r 

A 

PE~ 

%r 

A 

PE 

SO 

00 

60 

38 

70 

78 

80 

I2S 

90 

I 90 

SI 

04 

61 

41 

71 

82 

81 

I 30 

91 

199 

S2 

0-/ 

62 

4S 

72 

86 

82 

I 36 

92 

208 

S3 

II 

63 

49 

73 

91 

83 

I 41 

93 

2 19 

S4 

IS 

6 ‘ 

S3 

74 

9S 

84 

147 

94 

2 31 

SS 

19 

6S 

S7 

7S 

I 00 

8S 

I S4 

9S 

244 

S6 

22 

66 

61 

76 

I os 

86 

I 60 

96 

260 

S7 

26 

67 

6S 

77 

1 10 

87 

167 

97 

279 

S8 

30 

68 

.69 

78 

I 14 

88 

174 

98 

30s 

S9 

•34 

69 

.74 

79 

I 20 

89 

I 82 

99 

3 4S 


Thurstone (iii) has demonstrated a method whereby items 
judged by paired comparisons may be compared without resort to 
approximations or estimates. This method is given below because 
it is probably the best available way of redyeing paired compari- 
sons. 
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Step I Let there be five variables, a, b, c, d, and e, each of 
which has been compaied with eveiy othei by a nuiiibei of 
judges The following table gives the percentages of judges who 
believe an item in any column is less valuable than the items listed 
at the left. 

Table 9 

Per Cent or Judges Who Believe One Item to Be Less 
Valuvble Tuan Another 



a 

b 

e 

d 

e 

a 

. 

323 

338 

211 

128 

b 

677 


41s 

242 

172 

c 

662 

S8S 


260 

136 

d 

789 

7 S 7 

740 


379 

e 

872 

828 

864 

621 


Sum 

. 3 000 

2493 

2 3 S 7 

I 334 

81S 


That IS, item c (looking in column c) is judged to be less valuable 
than Item b (looking in line b) by 41 5 per cent of the judges 
Likewise item b is judged to be less valuable than item c by 58 5 
pci cent of the judges. 

It will be noted that the sums of the columns of percentages 
indicate that the variables are in rank order. If they are not m 
rank order, they should be placed in rank order in proceeding to 
step 2 

Step 2. Turn these percentages into their equivalent deviations 
in terms of their respective standard errors of observation The 
Kelley-Wood Table of the Normal Probability Integral * is used. 
The percentages are read m columns headed p or q, while the de- 
viations m terms of the respective standard errors of observation 
are read on the same line in colunm x. 

Table 10 

Percentages of Table 9 Transmuted into Corresponding 
Standard Deviation Values 



a 

b 

C 

d 

e 

a .... 
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—1 14 
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d 
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I 40 

I 11 
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—3 50 


( 

* Kelley, T. L.. Stiaisttcal Methods (The Macmillan Qompany, 1923), p 371. 
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Step 3 The deviations are then added algebraically. 

Step 4 The “scale” separation of adjacent columns is computed 
by the following formula 

where Si — Sj = — [Sxnc — 


Si — 82 = scale separation 

n = number of values m a column 
2 xn, = sum of first column of pair 
Sxik — sum of second column of pair 


For instance, in computing the scale separation of a and b one 
obtains 


Si — S2 = [2 82 — I 40] 


= .402 


These values become 


Sa-j, = .402 
Ss-o = .082 

So-d — .83^ 
Sd-e = .472 


Step 5 These may be placed more conveniently in a scale hav- 
ing only positive values by setting the scale value of e arbitrarily 


at .000 and 

adding 

the scale differences between successive 

variables 

a 

Scde values 

1788 


b 

.. 

1 386 


c 

. 

1 304- 


d 

.... 

472 


e . 



. 000 


Thurstone recommends that percentages in step i greater than 
97 or less than 3 be omitted from the table as being too unreliable. 
The corresponding standard errors of observation must then be 
omitted from the table in step 2. This necessitates a slight change 
in the procedure in steps 3 and 4. In step 3 only those values 
will be added for which there are corresponding values in the ad- 
jacent column. This may mean that any one column may have to 
be added twice, in order to provide corresponding scores for the 
columns on its left and on its right. In step 4, n must always be 
the exact number of values added In the column 
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Tadll II 

Table of Standard Deviitions to Be Assigned for Rank Positions 
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Ranks yield a peculiar kind of distribution In ranking, every 
individual is one step away fiom the next peison The distribution 
formed is said to be lect angular and is quite unlike the foim of 
distribution obtained from testing with a scale in equal units. 


Distribution of R\nks 
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When we want to compare ranked data with data obtained 
from a linear scale, as scores on tests, ratings, etc , the ranks must 
be transmuted into equivalent scores The best method is to trans- 
mute the ranks into standard deviation units. Tables have been 
devised for this by Ream (87) and Chu (20), but these tables 
are inaccurate Table ii on page 90 has been computed afiesh 
fiom the Kelley-Wood Tables of the Normal Probability Integral. 

Another method of transmuting rankings into units of amount 
was devised by Hull This method consists fiist of turning each 

rank into a per cent, using the foimula ^ where N is 

the total number ranked, and then estimating the score fiom 
Table 13 on page 92 

The following example shows how the method, including the 
table, is used. 

Given eight pupils ranked in order of industry, to transmute 
these rankings into units of amount on a linear scale • 

Pupil A, who has a rank of i, has a per cent position of 

^ ^ = 6.25 His score from Table 13 is nearest 8.0. 

o 

Table 12 

The Ranking of Eight Pupils Turned into Units of Amount 


Puptl 

Rank 

Per cent position 

Score 

A 

I 

6.25 

80 

B 

2 

187s 

67 

C 

3 

31 *5 

60 

D 

4 

43 7 S 

53 

E 

5 

5625 

47 

F 

6 

687s 

40 

G 

7 

8125 , 

33 

H 

8 

93 7 S 

20 
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T\ble 13 

Table for Transmuting Rvnkikcs into Units of Amount 


(from IIull)^ 


Per cent 

Score 

Percent 

Store 

Per cent 

Score 

09 
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22 32 

6S 

8331 

3 I 

20 

98 

23 88 

64 

8456 

30 
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97 
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8S 75 

29 

4 S 
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27 IS 
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86 89 

28 

61 

9 S 

2886 

6 I 

87 96 
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94 
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60 

88 97 

2 6 

97 
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8994 
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1 18 

92 

34 2 S 
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90 83 

24 
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91 
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91 67 
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1 68 
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2 2 
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It IS often necessary to combine rankings or ratings where no 
judge has ranked or rated all of the individuals. For instance, if 
each member of a school faculty were asked to rate the other 
members, it is obvious that in some cases acquaintance would be 
so slight as to render an accurate judgment impossible. It is the 
usual practice, therefore, to ask one person to rate or rank in order 
merely the ten or a dozen persons he knows best. As a result, the 
latings are incomplete, 1 e , do not include the entire group. 

* Fiom Hull’s Aptitude^Testxng, p 387 Copyright 1928 by Wodd Book Com- 
pany, Publishers, Yonkers-on-Hudson, New York. _ - ^ 
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Garrett (36, p 171) has studied the matter of combining in- 
complete rankings (01 latings) He consideied foui iiietliudb (a) 
turning ranks into standaid deviation units (as explained on p 
91), (b) using a simple average of ranks, (c) turning ranks into 
percentiles (as explained on p 91), and (d) making a compari- 
son according to the method pioposed by Thoindike or Thurstone 
(see pp 86-89) His conclusions are 

“i. The final order of amount obtained from incomplete judg- 
ment lists by the S D , Aveiagc, Peicentilc, or Comparison meth- 
ods tallies very closely with the best ‘standard’ or ‘true’ order 
The moie numeious the partial lists, and the longer the lists (other 
things being equal), the closer the agreement of the final older 
from the partial lists and the final older from complete lists 

“2 The order of amount obtained from scattered and sparse 
data may be veiy maccuiate when judged by the order obtained 
from complete lists. With such data no method gives accurate 
results 

“3 Judged from the standpoints of simplicity and time re- 
quired, either the Average method or the Peicentile method is 
superior to the other methods As far as acuiacy is concerned, 
the Aveiage method is ceitainly as good as the other thiee, if 
not better 

“4 When the variability of the individuals or things rated is 
desiied, either the S D or the Percentile method may be used to 
advantage Of the two, the Percentile method is the more simple 
as It avoids the use of negative quantities.” 

In combining two sets of ratings the two factors of impoitance 
and variability must be considered and allowed for If ratings have 
been turned into standard deviation units or percentiles, the vari- 
ability of different rating scales may be taken as equal and no 
allowance need be made for this factor when combining. 

Reliability of Ratings 

The reliability of ratings has been found to be variable and 
disappointingly low, indeed, it has been found by various woikers 
to be so low as to cast grave doubt on the value of rating as a 
method for gathering trustworthy data. 

The reliability of personal ratings is usually measured by the 
correlation between ratings by two comparable judges * 

•Shen (94, p 232) has pointed out that this procedure for determining the 
reliability of ratings really leads nowhere, because it is next to impossible to 
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Miner (78) reports a reliability of .54 when sixty-four college 
seniors were rated by instructors for general ability and .57 when 
thirty-six students were similarly rated Hughes (55) reports an 
average reliability coefficient of 56 when 253 high school students 
were rated on twelve traits and an average reliability of 63 when 
seventy-nine students were rated on twelve traits Shen ( 94 )> ^ 

careful study, reports an average reliability coefficient of 55 for 
thirteen judges (presumably college students) rating themselves 
and each othei on eight tiaits Webb (120) obtained an average 
reliability of 55 when ratings weie made on forty-five traits of 
194 school-boys raied in gioups of twenty (average age twelve), 
and another average reliability of 55 when 140 young men in 
groups of about thirty-five (average age twenty-one) were rated 
on twenty-eight traits Webb admits having rejected fifteen pairs 
of estimates out of 112 for the boys, and sixty-three out of 443 
for the students Using his complete data the average reliabilities 
are boys 49 and students 47 Waite (117) found reliabilities of 
47 and 50 when 1,405 and 2,018 pairs of judgments were ob- 
tained on school-children Hayes and Pateison (43) state in a 
report on the graphic rating scale that correlations between eight 
supervisors were as a rule over 65 Gallup (35) reports the use of 
a giaphic lating scale which yielded a correlation with the educa- 
tional diicctor’s latmgs of 66 m estimating the success of retail 
salespeople 

Kornhausei (69, p 120) reports on ratings of speed, accuracy, 
and general value for a group of specialized office workers “The 
ratings by the three supervisors agreed very closely with one 
another even though they were submitted quite independently. 
The correlations between the ratings by each supervisor and those 
by each of the other supervisors also fall betw^een o 70 and o 90 ” 
It IS unfortunate that Kornhausei did not report more fully the 

obtain two persons who judge with equal reliability or, in other words nho 
are comparable judges “Hence a correlation between two judges is a very 
crude approximation to the reliability of either The reliability of a judge 
thus crudely evaluated often varies considerably according to the judge with 
whom he happens to be correlated” Shen has derived a formula for the elimi- 
nation of this systematic error in the determination of reliability which he calls 
correction for pulling, based on the proposition “the correlation between two 
series of latings on the same trait, independent in errors of each other, is equal 
to the geometric mean of their true reliabilities ” Practically all reliabilities re- 
ported II' psychological literature, however, are obtained by merely correlating 
the ratings of one judge against those of another. 
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conditions under which he obtained such high reliability coeffi- 
cients. 

Furfey (34) reports that analyzing a trait (developmental 
age) into several sub-traits, having the judge rate all these sub- 
traits separately on seventy-five boj^s, and then combining these 
separate ratings into a final score gives the high reliability coeffi- 
cient of 888 

A reliability coefficient of .55 can be said to be typical for 
rating peisonality traits by ordinary judgment methods Some 
tiaits yield highei reliability coefficients, others lower It is easy to 
fall short of even this average figure of 55 if the raters are care- 
less, if the tiaits are loosely defined, if acquaintance with those 
being rated is slight, 01 if there has been inadequate observation 
There is no evidence that either the man-to-man method or the 
giaphic rating scale yields higher leliabilities when only one char- 
acteiistic has been rated However, uhen several traits arc rated 
independently and their results combined, the reliability may be 
raised to much higher figures Indeed, if a trait is analyzed into 
sub-traits which have beeen found by analysis to correlate highly 
with the trait, and ratings on the sub-traits by an individual are 
combined into one rating, the result may have a reliability so much 
higher that it is only slightly under -what might be predicted by 
the Spearman-Brown formula from a single rating On the other 
hand, if the analysis into sub-traits is subjective, the composite 
rating may be no better than the blanket lating 

The conclusion drawn from facts concerning the reliability of 
ratings is that in general the rating by a single judge is too un- 
reliable to be useful Rugg (90) has set up the standard gage that 
human character can be rated accuiately enough for practical pur- 
poses in education only when the rating is the average of three 
independent ratings The number of independent ratings which 
should be obtained may be dcteimined by means of the Speaiman- 
Biown foimula* The accompanying table enables one to know 
how many independent ratings are needed to obtain certain de- 
sired reliabilities. In one column is given the number of ratings 
necessary to obtain a reliability of 82 — an average figure for a 
forty-five-minute standardized test of achievement in high school. 
In the other column is given the number of ratings necessary to 

*Chu (20) and Remmers, Shock, and Kelly (89) give data to show that 
ratings do follow the law expressed in the Spearman^rown formula 
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obtain a reliability of .90 — good for satisfactory individual diag- 
nosis. 

Table 14 . 

Number of Rvtings Necessvry to Obtain a Specified Reliabilitt 
FOR Various Traits 




NUMBER OF RATINGS TO OBTAIN 

A RELIABILITY OF 

8e 

PO 

Average trait 

55 

4 

6 

Scholarship . . ^ 

7J 

2 

4 

Leadership . .... 

68 

3 

5 

Iiiu.llc<.tu<il quickness 

68 

3 

5 

IiUcllcLlUciI profoundness . . .. 

64 

3 

6 

h [cmor/ . . . 

55 

4 

8 

Persistence ... 

40 

7 

14 

Adaptability . . . .... 

•37 

8 

16 

Impulsiveness . 

•34 

9 

18 


As a matter of fact, then, Rugg erred on the side of leniency in 
requiring only three independent ratings. The above table shows 
eight as the average number of independent ratings one should 
obtain if the ratings aie to be individually diagnostic. 

Tendency to Rate Too High 

Unreliability is not the on>y charge to be made against rating. 
The tendency to late too high or to vary the standard of rating 
introduces other serious errors. Two raters may show high agree- 
ment so far as the reliability coefficient indicates, and yet may so 
differ in their level or standard of judgment that their ratings of 
the same person differ considerably This lack of uniformity in 
standards is due very largely to differences in the inteipretation 
of the descriptive adjectives which define the steps of the scale 
Boyce (ii) says, “A teacher who ranks Eicellent in one officer’s 
mind IS found to rank only Good or perhaps Medium in the 
rating of another j'udge who is more critical and less easily 
satisfied ” 

It has been found that ratings tend, on the average, to be too 
high, often so high that the lower end of the scale may not be 
used at all Evidently we have been overgenerous in judging if 
only the upper half of a rating, scale has been used when it 
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was expected that the ratings would be distributed noimally. 
Terman calls this the generosity factoi He believes “that we 
always tend to overrate those we like” and that “when we are 
rating those we dislike, the generosity factor probably operates 
negatively ” 

As a correction for this error, some graphic rating scales have 
been so constructed that most of the descriptive terms represent 
above average m the trait. Instead of 

bad poor average good excellent 

a scale will read 

poor fair good very good excellent 

thereby taking advantage of the frailty of the human mind in judg- 
ing others 

Several suggestions have been made for correcting ratings and 
thereby reducing them to a common basis Paterson (82, p 91) 
advises the following method “The reports made by a given 
supervisor or foreman are assembled and the ‘total scores’ are ar- 
ranged in a frequency distribution from high to low. This distribu- 
tion IS then divided into five parts so that the highest 10 per cent 
of the total scores are given a final letter rating of A, the next 
20 per cent aie given a final letter rating of B, the next 40 per 
cent are given a C rating, the next 20 per cent are given D, and 
the lowest 10 per cent are given E The limiting points in the total 
score are noted and a ‘key to final ratings’ is prepared whereby 
future ratings made by this supeivisoi may readily be conveited 
into final ratings This piocedure converts the actual ratings into 
relative ratings and is designed to do away with the error which 
otherwise arises because some supervisors rate" too high and other 
supervisors rate too low ” This coirection is based on the assump- 
tion that every supervisor is rating a group which distributes itself 
normally in a trait and that every group has the same distribution 
as every other group Of course this is never exactly true, but 
without further knowledge it is the best assumption that can be 
made. Indeed, it is probably a truer assumption m any given case 
than that certain groups are select 

Two other suggestions have been made for eliminating the 
effects of differences in standards. As mentioned before, Knight 
(63) suggests raqking instead of rating, fo^ then individuals are 
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compared with each other, though nothing is said concerning the 
absolute level of the group as a whole, nor that of any individual 
The other suggestion (loi) is to incorporate at the top of 
every graphic rating scale a senes of percentages (as in Rating 
Chart B on page 64) t\hich will suggest the number to go in 
each compartment. This method will reduce but will not eliminate 
differences in standards in rating. 


Factors Influencing the Reliability of Judgments 

There are many factors contiibuting to the unreliability of rat- 
ings An undci standing of these factors may help to lessen or 
eliminate the effect of some of them and may lead workers to ap- 
pieciate the difficulties involved in obtaining reliable ratings In 
the fiist place it seems to be easier to late some people than 
others Secondly, judges differ in their ability to give reliable rat- 
ings In fact, the same judge at different times and under different 
circumstances will differ in the reliability of his judgments. 
Thirdly, the traits or acts being rated differ in the degree to which 
they can be reliably rated. 

Differences m persons being rated. Norsworthy has demon- 
strated that we agree in our judgments about some people better 
than about other people Seven college girls rated ten members 
of a soroiity From the figuies, “it would seem that among ten 
girls who know each othei w^ell there may be twice as much dif- 
feience of opinion about some one member of the group as about 
some other” What factors are involved heie is obscure 

The effect of acquaintance on the icliability of rating has not 
been very thoroughly studied Clecton and Knight (21) found 
that there w'as practically no coiielation between the ratings of 
casual obseivers and close acquaintances on the same individuals. 
On the other hand Landis (72) reports that the reliability of 
ratings does not differ whether made by intimate associates or by 
general acquaintances Shen (93), who has made the most analyti- 
cal study of the effect of acquaintance on ratings reports that the 
average error of ratings is not affected by the degree of acquaint- 
ance. The “casual observers” of Cleeton and Knight were mem- 
bers of an audience who rated men on a stage by appearance 
only on first observation. It seems probable that degree of ac- 
quaintance or friendship after passing a certain threshhold does 
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not affect the reliability of ratings This is not the same as saying 
that obseivation does not increase the reliability of ratings Cady 
finds, for instance, that observation is an important factor in 
increasing reliability 

When we investigate the leliability of self-estimate as compared 
with the ratings of otheis, we find that the results vary. Allport 
(3, p. 129) and H L Hollmgwoith (23) report that there is a 
larger average error in self-estimate than in the estimate of others. 
Cattell (16, p S42) and Shen (93) report that the eiior in self- 
rating is less than in the rating of associates Such divergent 
findings indicate that the diffeience is small, whichever way it 
lies 

Differences in ability to judge. Slawson (96) makes the point 
that two ratings by the same judges are no better than one. This 
is an important finding which should be veiified, for, if true, it 
precludes trying to mciease the reliability of ratings by having 
a judge repeat his owm ratings To increase the reliability of 
ratings, additional ratings must be made by independent ob- 
servers. 

Boyce (ii), Cattell (17, p. 317), Hollmgworth (51), Hughes 
(54), Lindsay (73), and others have noted the fact that indi- 
viduals differ in their ability to rate accurately For instance. If 
several persons have rated a group on some characteristic, and 
their ratings are averaged, this average may be taken as a toler- 
ably accurate measure of the group Of the several judges, some 
will correlate more highly with this average than othei s, indicating 
that some are better judges than others In fact, it is commonly 
assumed that the individual whose judgment is most in harmony 
with the judgment of otheis is the most competent judge Is this 
difference in ability to rate due to a general judicial capacity, to 
general intelligence, or to factors which are more specifically re- 
lated to the particular situation in question^ Hollingwwtli (50, 
51) could detect the opeiation of no geneial j'udicial capacity. 
That is, an individual who is a good j'udge in one situation as 
compared with others may be a poor judge in another situation 
The j'udge who correlates highest in his ratings with the “average” 
ratings on one trait may correlate comparatively low with the 
average rating on some other trait. Snow (97) tells us that busi- 
ness men are not necessarily better judges of the qualities of their 
employees than are other men. 
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Wells (i2i), as a result of his studies of literary merit, makes 
some valuable remaiks about this general judicial capacity He 
says that one is inteiested in ability to judge in a particular field 
rather than in the abstiact power of judgment To speak of 
j'udicial capacity is to refer to a faculty which psychology long 
ago discarded Wells declares that one must have information as 
well as the ability to weigh it That is, what counts is familiarity 
and expertness in the field in which we give our judgments rather 
than a geneial capacity to make judgments. Since part of the 
act of judgment is the noting of fine points and the making of 
distinctions, this can be achieted only through expeiience and 
mteicst in a particular field Vaiious findings by other investi- 
gators tend to thiow light on this matter Remmers and Place 
fSS), and Cady (15) found that teachers agree better in rating 
students than students agree among themseK es It is the teacher’s 
business to know his pupils and to be able to form estimates of 
them In making these ratings teachers were doing something they 
weie more skilled at than the student, even though the students 
knew each other more familiarly 

Arlett and Dowd (6) suggest that even trained judges may 
vary because tiaits function in different degrees m different situa- 
tions The class-room teachei, the physical education director, and 
the laboiatory assistant have contacts with pupils in different 
situations and quite justly may form different opinions of the 
pupils on various traits. Webb (120) suggests that part of th^e 
unreliability of ratings is due to differences in points of view The 
matter of definition of the traits to be rated, discussed m an earlier 
section, has a definite beaiing on reLability of ratings. Again, 
trained judges may vary in their ratings because of personal likes 
and dislikes, and while this variation operates primarily to injure 
the validity of ratings, it also lowcis the reliability of the ratings. 
Landis (72) points out that superficial physical characteristics 
influence judgment of deeper character qualities, with a resulting 
lessening in the value of ratings. 

What are the characteristics of the good judge of others^ This 
practical question has stimulated much thought. Those who do 
well in mental tests are not necessarily better judges of human 
natuic within the limits of variability of the group studied. There 
are however, two exceptions to this rule, as found by Holling- 
worth (23) and his students. Those who do well on mental tests 
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are good judges of intelligence and humor As a generalization, 
we may affirm that, for certain admirable traits, there is a correla- 
tion between possession of a tiait and ability to judge it. The 
more admiiable a trait, the closer the iclation between possession 
of It and ability to judge it Also, one who knows himself best 
also knows others best in such desitable tiaits as neatness, in- 
telligence, humor, refinement, and sociability The reverse of this 
seems to be true in the case of undesirable traits Those who pos- 
sess a high degree of vulgarity, snobbishness, or conceit are dis- 
qualified to judge others on these tiaits To rate desirable traits 
of character successfully, however, one should possess those traits 
himself to a high degice 

Adams (i) after studying extensively the characteristics of the 
good judge of personality, was led to differentiate the character- 
istics of the successful judge of self from the successful juB^^f 
others He found the accuiate judge of self somewhat more tn- 
telligent and more observing than the good judge of others The 
good self-rater tends to be happier, less gloomy, less irritable, less 
liable to lose his head, more sympathetic, more geneious, and 
more courageous than the good judge of others. But the out- 
standing characteristic of the good self-judge is his greater social 
interest and adaptability Adams explains the appaient paradox, 
that the one who is most interested in others undci stands himself 
best and vice versa, as due to the fact that the person interested 
in others is best able to judge his own characteristics most im- 
partially. 

Hollingworth (jo, p 141) in his extensive investigations has 
brought out other interesting relationships A group has been 
found to agree most with respect to their likes, while differing as 
individuals w’lth respect to antipathies For instance, a group which 
agrees in its general liking for certain types of jokes does not agree 
to the same extent in its aversion to types of jokes. On the other 
hand, though an individual shows offense or irritation at certain 
types of jokes rather uniformly, he does not respond so uni- 
formly in the type of joke that does appeal to him. It is question- 
able whether this generalization will apply in other situations. 
Again, in a study of the persuasiveness of advertisements, Hol- 
lingworth (51) found that men agree most in their preferences, 
women in their dislikes Is this a general sex difference^ In 
another study Hollmgwoith leporls that negative judgments are 
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more variable than positive judgments These issues and Hol- 
lingwoith’s conclusions need further study and veiification There 
aie many questions beaiing on ability to rendei accurate judg- 
ments the solution of t\hn.h would throw consideiable light on 
the reliability of latings 

The consistency of latings, in the sense of self-consistency, is 
another phase of ability to rate. By consistency in this sense is 
meant the correlation between a j'udge’s ratings at one time and 
those made at another time Now being a consistent j'udge — ■ 
agreeing with oneself on two different occasions — is not the same 
thing as agreeing with other people oi with the true situation 
Natuially one would expect an individual to agiee more closely 
with his oun judgment than with the judgment of another per- 
son or with the average judgment of many people Hollingworth, 
in studying the judgments of persuasiveness of advertisements, 
found this to be true The average correlation of the first and 
second judgment of an individual was 72, whereas the average 
con elation of judgments with the group average judgment was 
only 48. 

Another related point is whether the consistency of rating for 
an individual is smaller or larger than the variability of judg- 
ments of a group For instance, it is conceivable that in rating 
pictures or music for merit, a judge might be consistent in his 
own likes and dislikes but might vary widely from the likes and 
dislikes of others On the other hand, it is conceivable that ten 
individuals might agiee moie closely m rating a group of appli- 
cants for a position than any one judge would agree in his own 
latmgs on two different occasions Wells (122, p. 547) records the 
following conclusions fiom his own study. 

‘‘We have thus made a study of '-ariability in three classes of 
judgment, fust, the highly subjective feeling of preference foi dif- 
ferent sorts of pictures, second, the moie objective judgment of 
color diffci cnees, and finally a type of judgment whose accuracy 
could be readily measured by objective means. It appeared that 
in the first class the judgments of each individual cluster about 
a mean which is true for that individual only, and which varies 
from that of any other individual more than twice as much as its 
own judgments vary from it, that in the second class, with the 
colois the variability of the successive judgments and those by 
dilTeient individuals markedly approached each other, but still 
prcscivcd a significant difference, while in the third class, with 
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the weights, we found that there might be an excess of the indi- 
vidual variability over the ‘social ’ This comparison seems to 
affoid, to a certain extent, a quantitative criterion of the subjec- 
tive.” 

Can judicial capacity be determined from consistency in rating^ 
Is the person who on a second occasion agrees with his fiist judg- 
ment necessarily a good rater ^ Both Hollingworth (47, 51) and 
Slawson (96) answer no, that there is no substantial relation be- 
tween capacity to rate and consistency of rating The consistent 
judge may really be a poor judge As Slawson says, “This relation 
was investigated with the intention of ascertaining whether it was 
possible to determine capacity from consistency, since the latter can 
be determined so quickly and easily The results were negative.” 

It has been found by Hollingworth and Cady (15) that re- 
liability varies with the degree of confidence in the ratings. If a 
judge in making his ratings is asked to indicate opposite each 
rating the degree of certainty with which he makes the rating, 
the ratings recorded with confidence have been found to be the 
more reliable High reliability of ratings may be obtained by 
selecting only those ratings of which the judge feels sure, — a 
fact of importance in using ratings in experimental woik 

Related to this last point is the discovery made by Cady (15) 
that certain ratings are apt to be e.\tieme ratings Wc are usually 
more sure of our judgment when we rate a person high or low 
than when we give him an intermediate position Indeed, inter- 
mediate ratmgs are frequently given simply because the rater 
knows little one way or the other about the person 
" Finally it has been verified by Cattell, Hollingworth (47), 
Snow (97), Cady (15), Cox (26), Wells (122), and others that 
extreme ratings are more reliable than intermediate ratings Cox 
(26) found, for instance, that the higher the IQ of a person, the 
more reliably that IQ could be estimated Cattell’s (16) study of 
eminent scientists revealed that there was mdeh more agreement 
as to the rank of the first few scientists than for scientists who 
came far down the list. It would seem that in proportion as a 
person becomes outstanding in any trait or quality, so much the 
finer are the discriminations that are made concerning him, prob- 
ably because differences in ability or quality between individuals 
are greater at the extremes. Genius is usiwlly subjected to ruth- 
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less scrutiny, simply because it is outstanding, whereas the average 
individual passes by unnoticed It was discovered both by Hol- 
lingtvoith (47 ch R') and in the Califoinia study of gifted 
children (103) that theic is regression in the evaluation of ex- 
treme cases, the tendency being to underiate the superior indi- 
viduals and to overrate the inferior individuals 

Differences in the traits being rated. First it should be noted 
that Yoakum and IManson (123) report that ratings on synony- 
mous traits coi relate v^ell. 

Several investigators have established the point that it is more 
difficult to rate ceitain traits than others Slawson (96), for in- 
stance, finds for ten traits the following average reliability co- 
efficients. 

Table is 

Reli\bility CocFnciENTs or Ratings ron Ten Traits 
(from Slawson) 


All-round value to service 

. . 603 

Coopcrativ encss 

. 522 

Leadership 

. S03 

Effort 

491 

Understanding of children 

. . . 472 

Professional inteicst and growth 

470 

Appearance ... 

. 460 

Tact 

4 S 3 

Punctuality . . . 

408 

Judicial sense 

335 


Cattell (16) found the relative agreement of judgments when 
twelve scientific men estimated the chaiacter of five of their col- 
leagues Norsworthy (8i), using the same list of traits, had nine 
members of a college sorority judged by five of their intimate 
acquaintances. Hollingworth (49, p. 79) has reduced their results 
to comparative figures which represent the amount of disagree- 
ment among judges in estimating traits of character, the average 
disagi cement being taken as 100 (see Table 16). 

Hollingworth (23, p. 173) in one of his investigations obtained 
data which shed light on the relative reliability with which traits 
are rated. In a class experiment twenty-five students rated them- 
selves and each other. Table 17 on the following page, for which 
ranking was the method used, shows the average deviation of 
judgment of twenty-four acquaintances on nine traits. The aver- 
age deviation for a pu'cly chance arrangement of items is 6. 
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Table i6 

Amount or Disacellmuji i amual Jldlls in Estimating, on the Basis of 
Acquaint iKCE, the Trufs of Otiiirs. in Tuo Investigations 
(from H L Hollingvvorth, Judging Ilvman Cliaiacter, p 79) 


RELATIVE divergence OF JUDGES 


Trait 

Cattell 
(iz judges) 

Norsworthy 
is judges) 

Average 
of both 

Efficiency ... 

■ . • • 7 S 

92 

83 

Originality 

• 95 

77 

86 

Perseverance 

• • 75 

lOI 

88 

Quickness 

go 

88 

89 

Judgment 

. 100 

78 

89 

Clearness .... 

.. 104. 

75 

90 

Energy 

■ • 75 

109 

91 

Will 

.. .. 85 

98 

91 

Mental balance 

.. . no 

81 

96 

Breadth 


92 

96 

Leadership . , 

.... 90 

103 

96 

Intensity . . 

... . 85 

113 

99 

Reasonableness . . 

. ... 115 

86 

100 

Independence 

. 104 

98 

lOI 

Refinement 

.... 90 

1 16 

103 

Physical health . . 

... IIS 

92 

103 

Emotions . . . 

.... 120 

91 

los 

Courage 

. . .100 

119 

109 

Unselfishness . 

. . IIS 

106 

no 

Integrity 

.. .. 104 

130 

117 

Cooperativeness . . 

125 

113 

119 

Cheerfulness .... 

.... 130 

II 2 

121 

Kindliness 

. . . . 120 

125 

123 


Classifi- 

cation 


Class A 
Median 89 

Close 

Agreement 


Class B 
Median 100 

Fair 

Agreement 


Class C 
Median 118 

Poor 

Agreement 


Table 17 

Average Deviation of Traits in Ranking 


(from Hollmgworth) 


Average 

deviation 

mrank 

Neatness 45 steps 

Intelligence .... 37 

Humor 45 

Conceit 41 

Beauty 38 

Vulgarity 3 5 

Snobbishness 48 

Refinement 5 9 

Sociability 47 


«... 44 steps 


Average 
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In Minei’s study (78, p 127), previously referred to, leliability 
coefficients aie gueii foi sev'cial tiaits. The figures quoted here 
are for tuo judgments agai.'st two others, rating fiom thirty to 
thirty-six individuals 

Table 18 

Reliability Coeiticients of Ratings for Six Traits 


(from Mmer) 

Rehabtltty 

coeficieHts 

Enerpy . 

78 

Le.idership 

77 

General abilitv 

76 

Reliability .... 

71 

Initiative . . ■ 

67 

Common sense 

62 


Shen (94) in his study presents reliability coefficients for thir- 
teen judges’ ratings on eight traits for twenty-eight individuals 
As averaged by the writer, they are as follows. 

Table 19 

Reliability Coefficients of Ratings for Eight Traits 


(from Shen) 

Reliability 

coefficients 

Scholarship 

. 71 

Leadership 

68 

Inrellcctual quickness . . 

68 

Intellectual profoundness 

64 

Memory 

55 

Persistence 

40 

Adaptability 

38 

Impulsiveness . 

24. 


Enough eviden ce has been given to show t^hat c ertain tra its may 
be ra^ed more reliably tTTan others. The reliability with which 
a trait may'”be'"rate(rrs depen'dentTnlarge part on the objectivity 
of the trait, and vice versa, the objectivity of a trait may be 
estimated by the degree of reliability with which it can be rated. 
Tne more objective the trait, the less two people will disagree 
in making judgments on it In general, traits which somehow leave 
their mark on things or influence external events are more re- 
liably rated than qualities which are merely characteristics of the 
person being judged. Hollingworth (49, p. 80) differentiates be- 
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tween the two extremes as follows* “The A traits we may desig- 
nate as ‘objective’ in the sense ihal they represent icactions to 
objects and impersonal situations and tasks, and aie likely to 
lesult in objective products such as inventions, factoiies, books, 
bank accounts, salaries, positions, records, etc These objective 
pioducts are definite manifestations of the traits m question, and 
they aie open to geneial inspection The C traits, on the other 
hand, represent reactions to the presence and character of other 
persons. They are peisonal. social, moial, they do not so definitely 
produce objective pioducts open to geneial inspection Instead, 
they lead mainly to peisonal and emotional reactions on the part 
of others, hence we may designate them ‘subjective’ traits” Cat- 
tell has noted that it is easiest to rate people’s reactions to ob- 
jective things and hardest to rate reactions to other people. 
Slawson (96) in his work found that the “uniformity, explana- 
tions, and accessibility of ciiteria” for defining a tiait were the 
most important factors in deteimining the objectivity of the trait 
Since different traits differ so markedly in the degree to which 
they may be reliably judged, it behooves one who is planning a 
rating card to consider the items to be included from this stand- 
point Such Items as cooperativeness, cheerfulness, kindliness, re- 
finement, adaptability, and impulsiveness should be avoided in 
favor of more objective and better-defined traits 

Hollingworth (47, p 112) discovered that the longer the series, 
the less accuiately any item in the senes was placed in rank by 
a series of judges. Confusion is apt to result if the number of 
individuals to be ranked becomes too laige It is never well to re- 
quire persons to rank more than a dozen or fifteen peisons on 
any trait 

Rugg (90) and also Slawson (96) have pointed out that a gen- 
eral all-round trait is moie reliably lated than a moie specific trait. 
This IS probably caused by the so-called halo effect Ratings on a 
narrow trait require moie detailed obseivation than is usually 
made. At any rate, as was mentioned on page 95, the combi- 
nation of ratings of specific traits that correlate highly with a 
general trait is considerably more reliable than the blanket rating 
of a general trait. In this connection Hanna shows that the cor- 
relation between ratings of teachers of the same subject is higher 
than the ratings of teachers of different subjects for the same 
trait. That is, as the observation of situaticms becomes narrow and 
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more uniform, judges will agiee in their judgments Cady (15), 
ioi instance, finds that observation increases the reliability of 
ratings The long and thoiough obseivation which should precede 
latings, discussed eailier in this chapter, is one of the surest means 
of increasing rating reliability. 


Validity of Ratings 

In a certain sense there is nothing more valid than a judgment 
In the final analysis, all of our knowledge has its origin in ob- 
servation and in interpretations made of observations However, 
we have just seen that human judgment is liable to error, and 
that variable or compensatory errors affect the accuracy of the 
rating without impugning its genuineness or honesty In the pres- 
ent section we are to consider the question as^ to whetlmr ratings, 
really are measures of w'hat they set out to measum We shall 
find that latmg? are also subjecTto certain constant errors and 
that apart from the inaccuracies called accidental or chance, there 
are other sciious errors that we may call systematic We shall 
find that human judgment is not strictly impartial, but has a 
tendency to eri through partiality and favoritism. 

The acquaintance factor. It is strange that acquaintance should 
sometimes be a factor in decreasing the validity of ratings 
Throughout this chapter the need for observation and more ade- 
quate information has been stressed There is perhaps no fajitox 
more impoitant in helping to reduce the variable errors of rat- 
ings than familial ity Hence we may say m one sense that ac- 
quaintance makes a judge more competent We all feel that a 
teacher is a better judge of the members of her class than of 
any other class in the school and that eveiy parent is a better 
judge of his own children than of his neighbor’s children But 
with acquaintance thcie creeps in an insidious tendency to become 
lenient or to show favoritism, or at least to be off one’s guard. 
Knight (64) first emphasized the systematic error that results 
from long acquaintance. He found that there was a distinct ten- 
dency for supervisors to rate the teachers highest whom they 
had known longest, where impartial testimony would not credit 
these older teachers with greater teaching ability. It would seem 
that with long acquaintance there goes an unconscious tendency 
to excuse, to explain aWay defects or to overlook deficiencies that 
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would be consideicd in an individual less well known Shen (93) 
finds that the same tenduicy to otenaLc acquaintances holds 
among college classmates This subdc factoi, vhich voiks un- 
consciously, must be lecognized and allowed for in using ratings 
m personnel work In the rating of teachers by supervisors or the 
rating of employees by managers this factor is certain to slip in 
to decrease the value of ratings. 

Self-rating. Another factor w'hich influences the validity of 
ratings is brought to light in studies of self-rating Cattell (16, 
p 542) avers that there is no constant error in judging ourselves; 
and, if w'e consider all kinds of persons and all kinds of traits, 
we may agree that in a certain sense this is so Although later 
investigators have noticed a definite tendency to overrate one- 
self, It can be pointed out that then studies dealt almost entirely 
with desirable traits Finally Hollingworth (23) has discovered 
that there is a tendency to oven ate the self on desirable traits and 
to underrate the self on undcstiable tiaits Hurlock (56) found 
that of 12,690 responses made by 423 children in selecting traits 
to describe themselves, only 6 per cent were ones relating to 
socially undesirable traits The eiror in rating was always in 
favor of the person doing the rating as compaied with what was 
obtained in his estimates of othci persons 

We do not begin to know all we would like to know' about self- 
rating Many factors influence the results, and several of the 
studies are contradictory Shen (95) finds that intellectual pro- 
foundness and memory aie underestimated to a greater extent 
than impulsiveness, although the former traits would hardly be 
called less desirable than impulsiveness Shen suggests that a de- 
fense mechanism is at play here, and that instead of overrating 
there is a tendency to excuse the self Self-iating may therefore 
be a key to the diagnosis of types of maladjustment wdiich are 
familiar both in schools and in industry Shen also finds that this 
tendency to overestimate or underestimate the self is more or less 
consistent with the individual, depending more upon the indi- 
vidual than upon the trait This is added evidence that aberra- 
tions of self-estimate may be symptomatic of maladjustments of 
the personality. 

Trow and Pu (114) believe that the tendency to overrate the 
self on desirable traits is a national characteristic. They find that 
thirteen to sixteen out of eighteen Chinese»college students whom 
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they studied tended to underrate themselves on desirable 
traits 

A single experiment by Hollingwoilh (52) bungs out two other 
phases of self-rating which he calls the factois of optivusvi and 
altruism A number of persons weie set to work by Hollingw orth 
at the continuous pcifoimance of a series of mental and physical 
tests After each trial the peifoimcr was requiied to judge whether 
he had done bettei or woisc than usual on this occasion. In each 
case another person was lequired to watch the performer, and to 
judge, in the capacity of witness, whelhei the perfoimance had 
been better or woise than usual for the individual who was doing 
the work Hollingworth found that a person has a tendency to 
judge himself as doing better than usual, thereby exhibiting a 
kind of optimism Witnesses, hovtever, showed an even greater 
inclination to judge a performance as better than typical, thus 
etidencmg not only optimism but altruism. 

This tendency to favor the self spreads to one’s friends or to 
rnembeis of one’s group Kinder (58), for instance, TuTds that 
there is a tendency to overrate members of the same sex as com- 
pared with membcis of the other sex Hart and Olander (38) tell 
us that men are more lement m their ratings than are women 
Remmeis and Place (S8j find that students late each other higher 
on the average than teachers latc them Cattell (16, p 542) finds 
in his study of eminent scientists that a professor is apt to over- 
estimate the standing of his colleagues There is a tendency for 
parents to overestimate the attainments of their own children 
There seems to be a general tendency to overrate not only our- 
selves on desirable traits but also those with whom we are asso- 
ciated or in whom we have an mtcicst 

So important does this matter of self-iating loom that Yoakum 
and Manson believe that the relative desiiabihty of traits may 
be determined from amount of over estimation in self-ratings 

Another phase of self-rating was brought out by Hoffman (46), 
who discovered that a person possessing a desirable trait to high 
degree usually underestimates his possession of it, whereas a 
person deficient in a desirable trait overestimates his possession 
of It to an even greater extent For instance, a pupil standing 
high in his class in school will modestly place himself somewhat 
lower m standing, whereas a low-standing pupil will rate himself 
higher. Allport (3) in corroboration tells us that the intelligent 
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underrate themselves less than the unintelligent overrate them- 
selves Even though paicnts tend to oveiesiimatc then childien’s 
attainments, parents of gifted chddicn usually undcieslimate their 
childicn’s gifts, Tciman (103) finds, since thev place a lower 
estimate on the children’s abilities than do their tcacheis Hoffman 
also finds that persons thought conceited by others are the more 
apt to overestimate themselves on desiiable tiaits 

Halo effect. Many investigators have noted the tendency for 
geneial impressions to spiead to specific traits, to which tendency 
Thorndike (107) has given the name “halo effect” Wells, in his 
Statistical Study of Liteiaiy Meat (121, p 21), was the first to 
mention this phenomenon He says, “Theie is a possibility of 
one rather disturbing constant erior in measuies of this nature, 
whose extent it is net er possible to know accuiately There is noted 
introspectively a tendency to grade for geneial merit at the same 
time as for the qualities, and to allow an individual’s general 
oosition to influence his position m the qualities This would be 
the case especially in the case of those qualities that were ill- 
defined in the minds of the subjects, and tended to be interpreted 
rather in terms of general merit Wc might thus have a grading of 
charm by general meiit instead of general merit by charm ” 
Webb (120) notes the same influence “Let us suppose, for in- 
stance,” he says, “that the obseiveis, in estimating the intelli- 
gence qualities, are biased in the diiection of maiking subjects 
who possess other desirable qualities too highly and vice versa ” 
The halo effect was noted to be a prominent factoi in connec- 
tion with the work with the Aimy Rating Scale Thoindike (107), 
the first to interpiet these findings during the Woild War, says, 
“The magnitude of the constant eiror of the halo, as we have 
called It, also seems suiprisingly laige, though we lack objective 
ciiteria by which to determine its exact size ” 

Rugg has described in detail the findings in connection ivith the 
Army Rating Scale. He says (90, p 37), “We judge oui fellows 
in terms of a general mental attitude toward them, and there 
IS dominating this mental attitude toward the personality as a 
whole, a like mental attitude toward particular qualities.” Rugg 
tells of a certain “Captain X” who was so well known and 
conspicuous in his group that he was used by thirteen officers in 
twenty different subordinate scales — physical qualities, intelligence, 
leadership, etc — as “scale” man on the .^Trmy Rating Scale as 
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“the pooicst man I ever knew” Yet this same Captain X stood 
fiist on three different psychological tests among 151 officeis 
He had been a Rhodes scholar fiom a Middle Western State 
university, and at Oxfoid he had made such a record that he 
was excused from ceitain examinations Comments of eight of 
the thirteen olEceis tvho had judged him so severely showed that 
their estimates of his intelligence, his physical qualities, and his 
leadership were dominated by their opinions of his personal 
qualities They were unanimous in saying that it was impossible 
“to live with him ” He was a ‘hotter, ” or “yellow,” o. a “knocker,” 
or “conceited ” “The suggestion comes insistently, however, that 
one of the most potent influences woiking against accurate es- 
timates of character is the pievalence of just such general atti- 
tudes toward our associates and suboidmates ” 

Knight (65) also has found that the halo of general impression 
IS a considerable factor working to lowei the validity of ratings. 
In his work in the field of estimating teaching success, Knight 
showed that suspiciously high coiieiations prevailed between the 
ratings of different tiaits. The same phenomenon was disclosed 
in an earlier study by Boyce (ii). Knight concludes, “This de- 
cided nionotony of the size of the correlations, which are ob- 
viously too high, IS potent witness of the presence of spread of 
general estimate . . . Their monotonous similaiity also suggests 
that, when analyzed judgments aie attempted, the influence of 
general estimate is so stiong that the resulting analyses are per- 
haps even more justifications of the gencial estimate than they 
are judgments of the specific trait.” 

HollmgMorth (23) warns that this spread of general estimate 
should not be confused uith general “stand-outishness ” Certain 
individuals do stand out prominently in many respects Correla- 
tion IS the general law o*" relationship between desiiable traits, 
so that an iiiduidual possessing a desirable characteristic will 
be found to possess other desiiable chaiactcristics also. 

The present writer (loi) attempted to estimate the size of the 
halo effect by having two teachers rate a group of forty pupils 
on seven traits. A composite rating for each child, found by add- 
ing together the ratings of the seven traits by one rater, was 
taken as the general impression rating Partial correlations were 
computed, holding constant the general impression of each of the 
raters. The resulting' partial correlation between any two traits 
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was, as It were, freed from the halo effect or the influence of 
general impression The average coefficients of rcliabihty as found 
were 390 and .377 The average partial coefficients of lehability 
were .145 and 200 The differences, 245 and .177, seive as 
a measure of the influences of the general impression. The study 
assigns five possible reasons for a large halo effect in the rating 
of any trait or habit (i) the trait or habit is one which is not 
easily observed, (2) the trait or habit is one which is not com- 
monly observed or thought about, such as one which is not usually 
emphasized m the classroom, (3) the trait or habit is not clearly 
defined, (4) the trait 01 habit is one which involves reactions with 
other people rather than “self-contained” behavior, (3) the trait 
or habit is one with high moral importance in its usual connota- 
tion Knight believes that acquaintance increases the halo effect 
Hartshorne and May report extensively on the con elation be- 
tween their performance tests and latings They found (39) 
that measures of class-ioom honesty correlated with a rating for 
general honesty around 40 With improved rating techniques 
they found (40) a correlation of 61 (corrected for attenuation) 
between total score for service on performance tests and total 
reputation for service by ratings The coi relation of the leputations 
with separate service tests was much lower The corresponding 
correlation between tests and ratings of self-control was .52 They 
demonstrate (41) that although the average correlation between 
actual judgments and objective tests is only 35, it is possible 
by taking the sum of latings obtained in different ways and the 
sum of tests on various characteristics to obtain much higher 
correlations — i e., between 50 and 60 With a wider variety of 
tests, several sets of ratings, and the assumption of perfectly accu- 
rate measurement, they demonstrate that the con elation would 
approach unity In short, though actual ratings agree only indif- 
ferently with tests. It is theoretically possible by extending both 
ratings and testing to cover a wider vaiiety of situations until 
the agreement would become as close as the accuracy of judgment 
and measurement permits. 


Conclusion 

Enough IS now known about ratings so that skilful and proper 
use may be made of this method of measurdhient Time was when 
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ratings ne’e indiscnmmate’i' useJ nith no thought of safeguards 
o; prer .urionarv lochnlCJUu^ l'iif:)"tun^te'v this holds tiue to-dav 
among lUinccs Directlv at,e. ''Voiid ^^al a leaction set in, 

due to cxpeiiencc nr.h tl.i Jx, -ug t'calc and also paiticu- 

Lih- to Rugg’s admissions concerning his experiences with rating 
,1'- se* foitli in his se'ie^ of articles entitled ‘Is the Rating of 
Ki Cl, n Chaiactei Practicable^' But experimentation continued, 
,1.1 J .ehnements tvcie made until to-day ratings are accepted, 
thc.g.i somcuhat gmgerlv, as a ’*alid means of obtaining data 
Bl'l tuc tva experience taught us to be alert against the unre- 
lialu. 'V of the single offhand laung 

Ratings are capable of great impiovement if the following fac- 
tors, among otheis aie considered and if enough pains are taken 
to ensure that the latings are reliable 

I Ratings should be made m a s ystema tic way. 

3. An extended period of obsenation should precede rating 

3 It should be kept m mind that rating is something In which 
the rater may mijioye through practice just as he grows more 
skilful in judging the qualm' of handwriting or an English com- 
position through practice 

4 IMore attention should be paid to defining the qualities or 
traits to be rate_d and more cxtensiye definitions should be intro- 
duced In place of the man-to-man rating scheme, a definite and 
extended description of the scale men might proye effective (a 
suggestion borrowed in pair fiom experience with the English com- 
position scale technique). 

5 Single latings should not be used in the rating of human 
qualities Sufficient leliabilit}’ mav be obtained only when a com- 
posite IS made of the independent judgment of from five to ten 
observers 

6 Foi expel imcntal purposes all ratings should be discarded 
except those which arc at the extreme ends of the rating scale and 
those on which the raters are sure of their judgment. 

7. Traits for rating should be selected which experience shows 
yield better than average reliability. 

8 So far a s possi ble bias should be eliminated from, ra ting s 
Individuals should not be expected to give fair ratings when judg- 
ing themselves, friends, old acquaintances, or persons whom they 
much like 01 dislike, _^dmiic or despise. 
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Chapter IV 
THE QUESTIONNAIRE 

J UST as the test is the instrument which one uses to test 
ability, so the questionnaire is the instrument best fitted tc 
measure conduct Although tests and questionnaires are some- 
times confused, the distinction between them should be sharply 
drawn. All research instruments are loosely called “tests” in re- 
cent experimental literature Although both tests and question- 
naires ask questions, and both expect answers which are true or 
are in accord with the facts, there is enough difference in their 
function to permit a differentiation in terminology In taking a 
test one tries to give the correct answer to the questions In taking 
a test, one is awaie that he is being tested and bends his energies 
accordingly, on the other hand he does not take a questionnaire — 
he answers it In answering a questionnaire the issue is not 
whether a person can answer the questions, but whether he will 
answer the questions tiuthfully In a test we look to the difficulty 
of the questions and arc inteiested in the speed with which they 
are answered In a properly constructed test a person’s score is 
limited by his ability, and he is unable to exceed certain scores no 
matter how haid he tries In a questionnaiie we eliminate diffi- 
culty by making the questions as easy and simple as possible and 
give the person as much time as he wishes to answer We place the 
emphasis on truthfulness of response In answ^ering a question- 
naire one may alter his answers at will, allowing them to portray 
one or another picture of the situation to suit a particular 
purpose. Tests, in short, are designed to find out what a person 
can do, while questionnaires arc designed to find out what a 
person has done or will do, or how he thinks or feels or believes. 
Questionnaires are instruments of research of great value in the 

’This latter statement is not unreservedly true As a matter of fact most 
pi rsons prob. bly Mould not be able to go against habit sufficiently to make 
verj' extensive alterations to their answers to a questionnaire The truthful 
person finds it very difficult to be untruthful. 
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investigation of conduct Rugg (32) has classified them Into three 
gioups (a) those asking for facts which the repoitei has ob- 
seived, (b) those asking for facts to be found in records, and 
(c) those asking foi reactions of the individual, such as beliefs, 
pieferences, likes and dislikes, wishes, judgments, and choices. 
Examples of the first type of questions are “What is your name^” 
“What IS your sex^” “What is youi age^” “Where were you 
boin^” “What is your occupation^” “What is your salaiy'” In 
this fiist type all the questions iclate to the peisonal history of 
the person reporting on the facts, and he draws the answers di- 
lectly fiom experience In the second type, the questions cannot 
be answered fiom memoiy or experience, and usually one must 
go to records to find the answers In this gioup Rugg places ques- 
tions regarding the age-grade distribution of pupils in a school; 
distribution of teacheis’ time among various subjects, statements 
from payrolls or class enrolment lecoids, etc In the third type, 
questions aie asked concerning a poison’s likes, dislikes, beliefs, 
wishes, tastes, interests, and pieferences The fiist two types are 
obj'ective in the sense that the questions pertain to facts which may 
be verified by others Witnesses may be called in to identify a 
man’s name, or the records may be resorted to for verification 
as to age The last group aie subjective in the sense that the 
answers refer to inner states of the individual which only he is 
able to examine and observe, and which he alone is in a position 
to divulge The accuracy of answers to questions in the first two 
types of questionnaire is generally taken for granted even though 
the answers may not in every case be exactly true. However, the 
psychology of testimony has icvealed to us the fact that a man is 
often unable to observe coriectly and often is impelled by motives 
which cause him to answer incoriectly, and much of the dis- 
cussion in what follows endeavois to give diiections for the con- 
struction and use of the questionnaire so as to minimize the factors 
which lead to untruthful answers 

The third type has been shunned for many years because it 
was believed that the answers to the questions were so subj'ective 
as to be of little value. The renaissance of this type of question- 
naire proceeds on an entirely new set of assumptions regarding 
its use. It Is recognized that the answers are subjective and may 
possess little basis in fact But the high reliability which these 
questionnaires possess show that they do measure something quite 
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consistently Consequently, while answers to individual questions 
aie recognized as having little significance, the answers to gioups 
of questions may show impoitant trends Coi relations aie used to 
determine what these trends are In using this type of question- 
naire, tliough one makes no assumptions concerning the truthful- 
ness of the answers, it has been found that certain answers point 
to conduct trends in the individual As an example, it has been 
shown that high school seniors who state that they do not like 
to play pokei are more studious than students who say they do 
like to play pol.ei The answers to this question, how'ever, should 
not be taken as an index of the extent of pokci-playmg in 'a par- 
ticular school even though, as occuned in the case in point, thiity- 
eight out of 12 1 answ'eis wxrc m the affirmative * There may have 
been a tendency for the other eighty-three boys to attempt to 
cieate a favorable impiession. In any case, theie was a tendency 
for the thirty-eight who said they liked to play poker to be less 
studious than the forty-eight who said they did not like to play 
pokci Whatevei the truth of the answers, this tendency tow aid 
non-studiousness on the part of those who admit an interest in 
poker is important 

It IS possible for the answer to a question and the interpretation 
placed upon the answer to be contradictory For instance in the 
Character Education Inquiry, a questionnaire was used called 
C E I Attitudes S f Examples of some of the items in this ques- 
tionnaire aie (i6, p 98) 

5 Do you always preserve order when the teacher is out of the 
room^ 

13 Do you usually pick up bioken glass in the si-reet? 

29 Do you lead the Bible every day? 

It was found that childien who gave yes answers to those ques- 
tions and answered the other thiity-three questions in similar man- 
,,^ncr tended to cheat more in class-ioom situations, in athletic con- 
tests, and with money than children who answered them in the 
reverse manner. Consequently these tests are considered good 
tesrs of lying The correlations, not the apparent value of the 
questions, tell the story. 

' S\ monds, P M , “A Studiousness Questionnaire,” Journal of Educational 
Psy(li'loi;y, 19 152-167 (192S) 

tH lit .home, H, and May, M A, Studies in Deceit (The Macmillan Com- 
p.tny, 1928), pp 9811 By permission of the Macmillan Company, publishers. 
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One should always know whether a questionnaire is being used 
to obtain facts 01 to study individuals Questioiiiiaiieb of the fiisL 
two types are used to obtain facts, and the answ ers are to be 
taken at their face value In questionnaiies of the third type the 
purpose IS not so much to obtain facts as to study individuals. 
There may be the wish to investigate the study situation of a 
school by means of a questionnaire To find the facts regarding 
study, one may ask such questions as “How long do you study 
outside of school each day^” “Do you take notes during your 
reading^” “Do you review the lesson of the previous day?” To 
find out how studious an individual is, the writer found the fol- 
lowing questions most significant “Do you like to day-dream^” 
“Would you like to own a revolver^” “Would you caie to become 
an aviator'” The answeis to these need not be taken at their face 
value as facts, but taken in combination they do show important 
trends in an individual 


Questionnaires Designed to Elicit Facts 

Experience in the use of questionnaires as instruments of in- 
vestigation has resulted in certain rules and ceitain warnings which 
by helping to avoid pitfalls result in more satisfactory returns 

In the first place the questionnaire should be used only in the 
pursuit of new or original inquiries Answering a questionnaire is 
a demand upon a person’s time and attention One must be careful 
not to misuse questionnaires with any group of people so that 
resentment or hostility to the method is aroused. There are two 
exceptions to the general rule that questionnaires be used only in 
a new or original investigation. One is when there is suspicion 
that the results of a previous similar investigation are umeliable, 
the other when after a lapse of time conditions have so changed 
that a repetition of the inquiiy could shed valuable light on move- 
ments or trends 

[ A second requirement is that the questionnaire should be used 
only in an attempt to elicit vital information Too often this valu- 
able instrument has been used in picayunish or trivial inquiries 
Here one must rely on judgment as to the value of the informa- 
tion sought. One criterion of this value is the sincerity with which 
it IS sought. In recent years the questionnaire has been resorted 
to by students in pursuit of graduate degrees»when the only excuse 
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for the inquiry was the hope that it would contiibute to the disser- 
tation A strict ceiisoibliip or or questionnaiics used for such ex- 
traneous pui poses would be w'elcomcd by then victims 

In general an inqunj should be pursued only when the results 
are of value not only to the persons to wdiom the leport is ad- 
dressed but also to those who help furnish the information The 
returns on a questionnaire aie always moie complete if those to 
w'hom It IS addiessed are made to feel its value They should be 
told wheie the final report will appeal, and provided the investi- 
gator IS in a position to cany out his pledge, those who answer 
should be piomiscd a pcisonal repoit on the outcome of the 
investigation Since no inquiiy should contribute merely to the 
selfish personal interests of the inr cstigatoi, it goes without saying 
that piofessional advice should nc\er be sought in a questionnaire. 

Before embarking on the active piooccution of an inquiry, the 
investigator should familiarize himself thoroughly wnth all previous 
work that has been done on the pioblcm A systematic search 
should be made for previous studies that have been made in the 
same field, and these studies should be carefully read and the 
findings organized There is a technique of bibliographical re- 
search (which cannot be desciibed here) which should be followed 
to make sure that previous woik on the pioblem has been utilized. 
An investigator should be familiar W'lth the known and the un- 
known, and also with the pitfalls and difficulties waiting for him 
in his chosen field The knozvn should be separately analyzed and 
organized and should find no part m the questionnaire unless for 
purposes of verification. 

In this connection it cannot be too strongly emphasized that 
information asked for on a questionnaire should not be available 
elsewhere Only too often the questionnaire is a lazy man’s method 
of obtaining facts or answers w'hich might be looked up in already 
existent printed reports Rugg (32) gives as an example the fol- 
lowing question included in a questionnaire addressed to teachers' 
“State the population of the village, town, city, or district in 
which you teach.” Besides the objection that answers given to such 
a question are unreliable, the investigator himself had access to 
this information in the census reports. 

Furthermore, questions should not be asked which require ex- 
tensive inquiry on the part of those to whom they are addressed. 
For instance it is hardly fair to address a questionnaire to a 
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school principal for information concerning his teachers — their 
training, experience, conditions of woik, etc — which would oblige 
him to canvass his teachers for these facts, when the teachers 
might be addiessed directly by the mquirci Likewise an investi' 
gator has no right to ask for an extensive compilation of records 
in an administrative office If he needs these compilations, he 
should provide for the necessary labor. In certain cases, however, 
questionnaires may justifiably be sent out which do ask for 
compilation on the pait of those answering The United States 
Office of Education, for instance, has asked for compilations in 
Its work of gathering school statistics Frequently large coopera- 
tive studies canvass the members of an association who are 
severally willing to assemble such data as constitute their share 
in a larger program in which all are interested 

One should always consider whether the poison addressed has 
the infoi matron sought for It is a common eiror to address ques- 
tionnaires to persons who are no more competent to answer them 
than IS the investigator Questions put to children should be es- 
pecially scrutinized m this respect For instance, in some of the 
questionnaires designed to give a measure of socio-economic level, 
questions are asked which no child should be expected to answer. 
“How many books in your home libiaiy?” is certain to lead to 
wild estimates on the part of most children Even if a little help 
IS given by telling how many books usually go in a foot of space 
on the library shelves, most children will have as much difficulty 
estimating the number of feet of library shelves as the number of 
the books themselves 

Defining the problem. A survey of the literatuie dealing with 
previous work on a problem is usually most revealing What 
originally appeared to be a single and modest problem will take 
on many aspects. Questions ciop up on all sides At the earliest 
opportunity the problem at hand must be delimited, issues must 
be sharply drawn, and those that are pertinent to the main prob- 
lem must be separated from those that are to be considered col- 
lateral There is every reason for laying careful plans for every 
step of the work at the very outset. Just as the architect plans 
every detail of the construction of a house on paper before a sod 
IS turned, so the investigator ought to foresee all issues that are 
likely to appear in his work and plan deliberately either to neglect 
them or to meet them adequately. 
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A novice is apt to underestimate the labor of drawing up, send- 
ing out, and tabulating the returns on a questionnaire Rugg states 
that the scope of the inquir}'^ must be planned from the be- 
ginning In certain inquiries a complete canvass is desired, in 
which case the decisions at the outset of the inquiry particularly 
concern the number and character of questions to be asked. How- 
ever, most investigations need not aim at a complete census or 
in\entoiy. If a picture or summary of conditions is wanted, the 
same results can be obtained by sampling, provided care is taken 
to ensure representativeness by taking into account all factors 
which might influence the results Do you wish to survey practices 
in the teaching of English^ Then be careful to recognize such 
factors as size of school, section of the country, and types of 
courses, and plan the investigation to cover all the variations 
which may occur because of such conditioning factors “Further- 
more, the delimitations of the extent of the study call for careful 
weighing of the relative value of having a small number of ques- 
tions and a large number of replies, or of having a large number 
of questions with a small number of replies.” (32) All of these 
matters must be decided before active work on the investigation 
is started. 

Administration, of the questionnaire. It is decidedly a good 
plan in preparing a questionnaire study to try it out in a pre- 
liminary form The scrutiny of prior studies shows, perhaps, the 
questions which one wants to ask, but there are many problems 
connected with formulating the questions and estimating the kinds 
of answers which will be received which only a preliminary try-out 
can decide If it is not possible to try out a preliminary draft on 
members of the same group to whom the perfected schedule will 
be sent, try it out on students or colleagues, or even on friends or 
members of the family You will find that some of the questions 
are ambiguous or are easily misinterpreted, and that others are 
not answered as you intended. In the light of these deficiencies 
and the criticisms of your friends, revise your blank for its final 
form. 

When possible, mail all schedules in duplicate If the inquiry 
is of value to those to whom it is addressed, they will appreciate 
a duplicate set of their answers, or at any rate a set of the ques- 
tions, for filing away. 

A space should be left at the end of the questionnaire for the 
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signature of the respondent. But because he may carelessly fail 
to sign his name, the sender should type in on the question blank 
the name and address of the one to whom it is sent, thus guaran- 
teeing that these essential data will be on the blank when it is 
returned. 

A self-addressed, stamped envelope should be included with the 
questionnaire as a courtesy. 

Buckingham urges that returns on a questionnaire be acknowl- 
edged, an additional courtesy well worth extending and one which 
perhaps can be satisfactorily accomplished by means of a postal 
card. 

It must be recognized that 100 per cent returns from a ques- 
tionnaire are never obtained, even in the case of questionnaires 
sent out by government bureaus which have a maximum of pres- 
tige and authority Where conditions are not so favorable, the 
returns may be no higher than 30, 40, or 50 per cent Factors 
which tend to raise the percentage of returns are the authority or 
prominence of the sender, the attractiveness of the question blank, 
the pertinency of the topic to the interests of those who are to 
answer it, and the various psychological appeals adopted to stimu- 
late an answer Unanswered quer'ions blanks are not always 
thrown in the waste-basket since oftentimes they are put aside till 
a more convenient time for answering and thereupon become lost 
in papers on the desk Carelessness rather than lack of interest 
may account for many failures to make returns It should be re- 
membered that m most inquiiies the choice of those to whom 
schedules are sent is a delicate issue When a considerable number 
fail to be returned, the representativeness of the sampling may be 
seriously deranged The investigator must always ask himself if 
there is a probability that those who failed to answer would have 
given the same answers as those who did If those who do not 
answer fail to do so because of lack of interest or because of 
Ignorance, there is strong probability that their returns, if they 
could be obtained, would alter the results. Usually, when returns 
are received from only a small percentage of those to whom ques- 
tion blanks are sent, some estimate must be made which allows 
for this factor of selection. 

Toops (36) suggests the use of follow-up letters to obtain a 
higher percentage of returns. Lindsay (22) found that a follow-up 
card and letter were of greater value than the original letter and 
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questionnaire The usual experience is to find that there are 
diminishing returns from successive follow-up letters Toops found 
that It took SIX follow-up letters, each using a different type of 
appeal, to get 100 per cent replies m a certain inquiry, where the 
returns to the original questionnaiie were 52.7 per cent He found 
that those best known to the sendei were the first to reply, in- 
dicating that acquaintance is one factor in producing returns It is 
wise in outlining an investigation to plan to include two or three 
follow-up cards or Ictteis, since these have a demonstrated value. 

Preparation of the question blank. Every questionnaire should 
have accompanying it a lettei stating the puiposes of the inquiry 
and enlisting cooperation Although m some cases a statement per- 
forming the same function may introduce the questions in the 
blank, the use of the letter is probably preferable The whole 
success of the investigation largely stands or falls on the strength 
of this original appeal Some letters aie sentimental, some banter, 
some cajole, some threaten, some are enthusiastic, some are 
mattcr-of-fact. The present wiiter piefers one that is straight- 
forward and truthful Let it show enthusiasm by all means, but, 
more important, it should plainly and succinctly state the problem 
and Its importance, both in general and to the person answering 
Poffenberger (29) suggests that the preliminary letter be made to 
play upon deep-rooted motives and desires One such device which 
may seem coaise, but uhich is undeniably effective, is to appeal 
to the person who is being questioned as one of a select group 
whose judgment is especially respected If a questionnaire is ad- 
dressed to “a leading teacher of experience” or to “one of the 
more thoughtful parents” or to one whose “expeiience and judg- 
ment IS especially valued,” this is playing on a motive that few 
are unresponsive to Because the appeal of authority is also power- 
ful (the census, backed by the authority of the law, has power to 
require an answer), questionnaires sent out by institutions or as- 
sociations are almost certain to command greater respect than 
those sent out by individuals. A student will always find that his 
questionnaire receives a larger percentage of answers if his letter 
includes a statement that his inquiry is sponsored by his in- 
structor, especially if the instructor has a wide reputation We all 
feel more inclined to answer questions asked by those whom we 
know. 

The design of the question blank is most important. Its size is 
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usually determined by exigencies of punting or mimeographing, 
and ot mailing But convenience in answeimg and in tabulating 
and peihaps in filing aie also factois A single sheet of questions 
is much to be pieferied to two or moie pages if the questions 
can be crowded into that compass without making the blank diffi- 
cult to use In the matter of arrangement, probably the most 
serious error is that not enough room is left for the answer. 
By all means do not cramp the spacing of the questions where 
an opinion or evaluation is asked which may take several lines. 
Plan to have answeis come at the right-hand side of the page 
and if possible in a column The plan of blocking off questions 


I 

2 

3 

4 

5 

etc 


irregulaily over the page as shown in the illustration is not to 
be recommended because questions are often overlooked and the 
tabulator finds that his difficulties are increased 

Some persons may prefer to answer a questionnaire on the type- 
writer, and the space for answeis should be so planned as to make 
this possible without cramping the 100m for written answers. 

Chaddock (9) warns that paper which will take ink should be 
used for the schedule Another good suggestion made by Chad- 
dock IS to number 01 letter each item so that the tabulator may 
readily identify it. 

Rugg (32) advises that the complete tabulation forms be pre- 
pared before di awing up the questionnaire If this is done, the 
peitmency of each question becomes apparent, and useless ques- 
tions may be avoided This pielimmaiy step will also ensure that 
the investigator words his questions so as to anticipate the form 
of the answer that can be used best. Often it happens that one 
wishes too late he had worded his question so that the answer 
could be tabulated in numerical terms. 

He who plans to use the Hollerith tabulating machine or other 
mechanical methods in tabulating the returns to the question- 
naire should confer with the operator of the»machine before draw- 
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ing up his question schedule From a carefully planned question- 
naire, with questions specially punted and numbered in a certain 
way, an operator can transfer the data to the Hollerith cards 
much more economically than from the usual form of question- 
naire blank. 

Finally, the investigator must get his question blank printed or 
typed Since the attention which a questionnaire receives is di- 
rectly affected by the attractiveness of the form in which it is put 
out, he should spare neithei care nor expense in piepanng the 
blank. Whenevci possible have it printed, as a printed sheet is in 
the cleanest and most atti active foim Multigraphing is better 
than mimeogiaphing If the mimeographing process is employed, 
use a good quality of papei, and keep the copy clean by cutting 
fresh stencils when necessary 

Preparation of the questions. The following rules have been 
found helpful in preparing questions for the schedule of an investi- 
gation 

I. The number of questions should be small. It is a temptation 
to ask many questions As one plans an inquiry, its scope tends 
to enlarge, and there is a tendency to allow the original problem 
to spread into by-paths One argues that since he is canvassing 
this particular group of people, he may as well ask this or that 
question, interesting in itself, though not strictly pertinent to the 
inquiry. The mass of questions lolls up like a snowball This 
tendency towaid expansion must be combated by rigorous censor- 
ship. Xo fixed standard can be set for determining what is the 
smallest possible number, because each inquiry has its own con- 
ditions, but a maximum of ten questions is probably safer than 
one of forty 

2 Questions should be brief Questions should be worded in the 
fewest possible words Probably the questions stated most briefly 
also excels in claiity and directness Poffenberger ^29) asked 
seventy-five persons to state a question to find out how much 
people usually pay for their socks (or stockings). Seventy-five 
different questions were framed — no two were alike They ranged 
all the way from “What do your socks cost you per pair?” to “We 
aie trying to get information concerning the average price paid 
for hosiery. Will you please tell us what you pay for your hosiery, 
specifying the kind of hose you wear, socks or stockings, and the 
make?” One has only lo imagine the form m which the answers to 
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the longer question will come to see how difficult they will be to 
use. 

j Questions should cover information desired Frequently a 
question is so worded that it does not ask what is intended In a 
questionnaire to determine health practices in a school, such ques- 
tions as “How much milk should you drink a day?” or “Do you 
usually sleep over eight hours a day^” are quite inadequate to get 
the information desired Instead of asking a person what he does, 
the first question asks for a judgment as to what should be done. 
In the other case the question will merely call foith yes and no 
answers without making it possible to determine the average 
number of hours or the distiibution of hours spent in sleep. 

4 Questions should be simple enough to be unde) stood A ques- 
tionnaire IS not an intelligence test In fact, it defeats its own 
purpose if It IS not so understandable that the most unintelligent 
person to whom it is addressed can answer the questions One 
way in which this point can be tested is to make sure that the 
vocabulary of the questions is not difficult, perhaps by checking 
the words used to see that they come well down on the Thorndike 
Word List Substitute a simple word for a more complex word 
where possible Avoid technical words where a common word will 
do as well If technical words must be used, which might be un- 
familiar to any member of the group addressed, they should be 
defined. 

5 Questions should be unambiguous. It is startling to discover 
how difficult It is to make a question which will not be misinter- 
preted, or better cannot be misinterpreted. In a questionnaire 
on study it was asked “What kinds of notes do you take^” 
Such a question may have the greatest variety of answeis, for 
there are so many different categories of notes — long or short, 
running or outlined, etc — that the replies would be difficult to 
summarize. 

6. Questions should be specific, not general. The general ques- 
tion is the greatest offender in questionnaires. One finds at every 
turn questions which ask for opinions, but which, failing to define 
the form the answer should take, yield useless answers. Rugg (32) 
gives examples of such questions: 

Do you have difficulty in obtaining clerical help? 

What in general is the attitude of the parents toward “home 
work” in school studies? 
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What differences m training do you notice between public high 
school commercial graduates and graduates of the common busi- 
ness college? 

By means of questions of this type one can justify the place of 
almost any subj'ect in the school curriculum Recently the Classi- 
cal Investigation canvassed Latin teachers as to the validity of 
certain objectives for the Latin course in high school. Of the 
Latin teacheis canvassed, 93 per cent stated that “the develop- 
ment of certain desirable habits and ideals which are subject to 
spread, such as habits of sustained attention, orderly procedure, 
overcoming obstacles, peiseverance, ideals of achievement, ac- 
curacy, and thoroughness, and the cultivation of certain general 
attitudes such as dissatisfaction with failure or with partial suc- 
cess,” was a valid objective for the high school course. This 
answer suggests the testimonials that are often given to dis- 
charged employees While the good faith of such an answer need 
not be suspected, we are left wondeiing to what degree teachers 
believe this ideal is accomplished 

Questions like, “Is the candidate well educated^” or “Do you 
consider Mr. Blank an experienced teacher?” are practically 
worthless. 

7 Questions should be stated in acceptable language. Double 
negatives or alternatives in the question should be avoided. The 
following question, cited by Poffenberger, is an example of a badly 
worded question. “When buying face powder, do you or do you 
not usually consider whether or not it is of a very fine texture^” 

8 Questions should be so aiianged that the answers can be 
made by checking Before inserting a question in the blank, one 
should anticipate the possible answers If the answer is to be yes 
or no, It IS well to put in parentheses after the question, or in 
small print beneath the space foi the answer (“write yes or no”). 
In some cases “yes no” is printed in the blank so that the answer 
chosen may be indicated by underlining. This is apt to lead to 
confusion as some people will cross out the answer they do not 

■■ consider correct, making it difficult to tell whether underlining or 
ciossing out was intended. 

In case the answer is not a mere alternative such as yes or no, 
all possible answers should be anticipated and listed in the ques- 
tionnaire, the answer to be indicated by checking the alternative 
selected. 



The Questionnaire 135 

Example ' 

Why did he leave schooP 

Moved away. 

Poor quality of school work. 

Poor health. 

Must work to help support family. 

Lack of interest. 

Parents did not see value in educa- 
tion. 

Low ability. 

In this example the correct alternatives should be checked 
Often the investigator will not be able to think of all possible 
alternatives. His preliminary use of the questionnaire on a small 
group can then be employed in getting typical lesponses which 
may be inserted for checking m the mam inquiiy Even with this 
method, not all possible answers will be listed, and blank spaces 
should be left for additonal answers. It is a fact, however, that 
such additional blanks are seldom utilized. It would seem as 
though, for most people, the suggestions made by the possible 
answers listed in the questionnaiie block the recall of other an- 
swers. 

In case the answer to a question is a series of numbers, the 
question blank should provide a tabulation blank wheic the an- 
swers may be conveniently listed. Rugg (32, pp. 54, 55) gives the 
illustration shown on page 136 

This general plan of anticipating possible answeis and listing 
them for checking, or of pioviding foims in which answeis may be 
conveniently written, helps both to make the questions easier to 
answer and to make the answers easier to tabulate Occasionally 
one deliberately asks a question when fieedom in answering 
should be allowed. There are times when the investigator wants a 
free expression of opinion, such as a statement as to the success 
or value of an undertaking, or when he is looking for a variety of 
suggestions, perhaps for future work. For answers to such ques- 
tions, one usually needs to allow a half or even a full page for the 
answer. 

g. Avoid leading questions. Questions should be so worded that 
they do not suggest an answer, or suggest«>one answer rather than 
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Omitting all names, will you give the individual yearly salaries that were paid 
junior high school principals, teichers, supervisors of special subjects, and 
principal’s clerk during the school year 1914-1915^ 

To make it easier for you, the salaries are arranged in groups in one column 
In the opposite column (marked number receiving'), will you place the number 
VI ho received the salary stated' 

Example — If four women teachers and one man teacher receive annual salaries 
between ^8oo and $825 respectively, enter them thus 


Annual salaries 

Men 

JFomen 

$800-825 

I 

4 


Principals’ 

salaries 

NUMBTR RrCLlMNG 

E veil S VI VRV GIVEN 

Teachers’ 

salaries 

NUMBER RECEIVING 

EACH SALARY GIVEN 

Men 

IV omen 

Men 

Women 

$1000-1099 



$ 500- 549 



HOO-II99 



SSO- S99 



1200-1299 



600- 649 



1300-1399 



650- 699 



1400-1499 



700- 749 



1500-1599 



750- 799 



1600-1699 



800- 849 



1700-1799 



850- 899 



1800-1899 



900- 949 



1900-1999 



9SO- 999 



2000-2099 



1000-1049 



2100-2199 



1050-1099 



2200-2299 



IIOO-II49 



2300-2399 



I I 50-1 199 



2400-2499 



1200-1249 



2500-2599 



1250-1299 



2600-2699 



1300-1349 



2700-2799 



1350-1399 



2800-2899 



1400-1449 



2900-2999 



1450-1499 



3000-3099 







another Munsteiberg long ago gave an apt illustration (cited by 
Poffenberger, 29, p 139) of the leading question: 

“When a clerk in a store says to a customer. ‘Will you take the 
package with you?’ an affirmative reply is anticipated and fre- 
quently received To the question ‘Will you have the package 
sent^’ an affirmative reply is again anticipated and frequently 
received.” * 

* Poflenberger, A T, Psychology of Advertising (A W Shaw Company, 
192s) By pcrmission_of thj pre^t publishers, McGraw-Hill Book Company 

















The Questionnaire 137 

So we see that such a simple thing as the form of questions 
asked by clerks may mean dollars and cents to a firm. 

Muscio studied the influence of the foim of a question on 
suggestibility He presented moving pictuics as material for ob- 
servation and asked a number of questions stated in eight differ- 
ent ways. The suggestiveness of a question was measured by de- 
termining the percentage of times a subject followed the lead of 
the question. The questions in order of suggestiveness are: 

Tvble 20 

Percentage of Times a Subject Followed the Lead op a Question 
(from Muscio) 

% suggestiveness 


1 Didn’t you see a ^ 91 7 

2 Did you see a ^ 89 2 

3 Didn’t you sec the * 840 

4 Was the (k) iit or 11 ^ 77 S 

5 Did you see the * 626 

6 Wasn’t there a > 518 

7 Was there a . ? 43 6 

8 Was the (k) m . . > 39 7 


The negative form of question is always more productive of 
suggestibility than the positive, and a question which implies a 
personal relation of the observer in seeing is more productive of 
suggestion than a mere question of fact The definite article, the, 
aroused suggestibility less than the indefinite article a. 

10. Ask questions that will be answered It is common sense to 
assume that incriminating questions uill not be answered, or that 
if they are answered, there is no assurance of the tiuthfulness of 
the answers Incriminating questions legarding health, religion, 
economic status, and habits are frequently asked in such blanks 
as the application for a mairiage license, driver’s (automobile) 
license, teaching position, etc Questions concerning habits in re- 
spect to use of intoxicating Iiqiiois were evaded even before the 
prohibition amendment. College students hesitate to incriminate 
themselves by stating the amount of time they spend in study or 
the amount of help they receive from others Questions concern- 
ing all matters considered private, such as one’s income, or matters 
which are the subject of moral scrutiny, and social taboo questions 

* Muscio, B , “The Influence of the Form of a Question," British Journal 
ol Psychology, 8 351-389 (Sept, 1916). , 
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are likely to go unanswered If they are answered, the answers 
cannot be trusted 

A diffeient type of question that will not be answered or should 
not be, IS one concerning which the peison questioned has no 
information Questions that relate to incidents or details wholly 
outside one’s range of interest and attention will not receive an- 
swers Try yourself to see if you can recall the number of pickets 
in a certain fence, or the number of steps leading up to a familiar 
building, 01 the number of pillais m the portal of some church. 
Although you have passed these buildings daily, you may never 
have chanced to make these paiticulai observations. Yet many 
of the questionnaire items icqune pieciscly similar detailed ob- 
servation 

Another type of question which will not in the nature of things 
receive a coricct answer, and perhaps no answer at all, is one 
which delves back far into the subject’s past experience This par- 
ticular type of question is now receiving much attention because 
of the claims of the psychoanalysts that they have thereby dis- 
covered m such questions the clues to much pathological conduct 
But our knowledge of the learning piocess makes it clear that 
most of our established ways of acting are not the product of a 
single incident but rather of a long series of similar incidents. 
Learning is not cataclysmic, it requires many repetitions of the 
same act, and hence an inquiry into the past to determine when 
one first did so-and-so is almost certain to result in inaccurate 
answers. 

Better than to ask for percentages In a question is to get the 
data from which the percentages may be computed. Every one 
cannot be trusted for accuracy in computation, and there is less 
liability of error in interpretation if one asks for the raw data 
Rugg warns us not to forget to ask for the total number of items 
m case a percentage is desired For example, an investigator who 
wished to know the proportion of brothers and sisters who lived 
to adulthood, asked. “How many brothers and sisters lived to 
early adulthood or longer?” and omitted to ask for the total num- 
ber of brothers and sisters. This particular question is also faulty 
in that adulthood is not defined and hence will not be interpreted 
in the same way by every one. 



The Questionnaire 


139 


The Psychological Questionnaire 

The third type of questionnaire which asks for reactions of the 
individual we have termed the psychological questionnaire. 
Whereas the first two types ask foi questions of fact, the puipose 
of the psychological questionnaire is to study the individual by 
asking questions relative to his beliefs, wishes, likes, interests, 
tastes, preferences, choices, feelings, fears, and worries. 

These questions may be divided into two groups according to 
the way in which it is intended that they be answered On the 
one hand instructions may be given that the issue is to be care- 
fully and. deliberately weighed and the answer given only after 
reaching a thoughtful decision. On the other hand one may in- 
stinct a person to give his immediate impression, or choice, or 
leaning, thus aiming to make the answers less a result of careful 
ratiocination and more of those sets or readinesses that detcrm.ine 
our immediate responses to vaiious stimuli This latter is the 
type of response that usually is desired in psychological ques- 
tionnaires Many questions are asked, and they are to be answered 
in a short time Though some have spoken of this as being an 
emotional response, perhaps in contiast with a thoughtful re- 
sponse, It seems doubtful whether the emotions play more than a 
minor role, if indeed they play any, in detei mining the answer. 
The emotions, taken strictly by themselves, are not diiective The 
response which is wanted in these questionnaires is the resultant of 
accumulated past expeiiences, rather than that of immediate 
ratiocination. 

Persons interested in the measurement of conduct have not 
been slow to adopt the testing devices previously found useful 
for testing judgment in the school subjects Goodwin Watson* 
describes various adaptations of these tests for measuiing attitude. 
These new-type questions aim at objectivity by suggesting possi- 
ble answers from among which the pupil is to select by checking 
the answer which he thinks is most satisfactory. Various classi- 
fications of these questions may be made. Here they will be merely 
listed in order and the reader, by noting likenesses and differences, 
may build up his own classification. 

•Watson, G B, Experimentation and Measurement in Religious Education 
(The Association Press, 1927) * 
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j. True-false. In this type a statement is given which may be 
either true or false, and the answer is indicated by writing T to 
indicate ‘‘true” or F to indicate “false,” (or + or — ) opposite 
the statement * 

Example 

I brush my teeth daily True False 

I bathe daily True False 

I 2 Yes-no test This test, composed of questions to be an- 
swered b)’ either yes or no, is already in frequent use in psy- 
chological questionnaiies It is similar to the true-false test, with 
the statement in this form turned into a question 

j Goodwin Watson makes considerable use of true-false 
or yes-no test items which permit a third alternative answer in- 
dicating that the statement is neither true nor false or is to be 
answered by neither a categorical yes nor a no This considerably 
broadens the scope of statements or questions which may be used 
in the questionnaire, and more discrimination is needed to decide 
between three altei natives than between two 

L 4. The multiple-choice item has had extensive use in achieve- 
ent testing, but less is knowm conceining its value for use m the 
questionnaire The multiple-choice item is a statement which 
ends with several alternatives. 

Examples 

I bathe daily, several times a w’eek, every week, less frequently 
than once a week. 

I brush my teeth after every meal, after breakfast and before 
going to bed, once a day before going to bed, infrequently 

* Manuals, in Tillich detailed directions for constructing true-false and other 
types of objective tests arc given ha\c been prepared by the following 

Hopkins, L T , The Construction and Use oj Objective Examinations (1926), 
University of Colorado Bulletin, 119 pp 

Odell, C W, Traditional Examinations and New-Type Tests (The Century 
Co, 1928) 

Paterson, D G, Preparation and Use of New-Type Examinations (World 
Book Company, 1925) 

Ruch, G M, The Improvement of the Written Examination (Scott, Fores- 
man and Company, 1924) 

Ruch, G M, The Objective or New-Type Examination (Scott, Foresman and 
Company, 1929) 

Russell, C , Classroom Tests (Ginn and Company, 1926) 

S> inonds, P M , Measurement in Secondary Education (The Macmillan Com- 
pany, 1927). ' **Op cit. - - . 
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Taken as matters of judgment, the distinction between the 
questionnaire and the rating scale disappears The answer to this 
kind of a questionnaire is a form of lating, and if one answers 
the questions concerning himself, it becomes self-iating The 
question frequently aiises as to who should be the person to an- 
swer a questionnaire on personal matters — the person himself or 
one of his friends The factors which help decide this issue are 
discussed under self-rating in Chapter III The questionnaire may 
even take the form of a rating scale Instead of having the ques- 
tions answered yes or no, lelative values may be indicated as in 
rating The graphic rating scale makes a very convenient form 
for the questionnaire when a graded answer is desired 

Looked at from this angle, any question may be answered by 
degrees on a scale For instance, in an interest test the following 
scale was prepared for use in checking the answers to each ques- 
tion L' D D', L stands for like and D for dislike The 
exclamation point was used to denote w'armth or certainty of feel- 
ing, while the interrogation point denoted a neutral position 
Hart^ in his Test of Social Attitudes employed thiee symbols, 
-|- — , to accompany each item The -(- was to be encircled 
to indicate liking, the — to indicate dislike, and the period to 
represent neutrality Fuither, the items were printed in groups 
of fifteen to seventeen, and m any group five items ivere to be 
underlined to denote those items about which the peison ansiver- 
mg felt most strongly Finally one item was to be chosen from 
each group about which feeling was strongest of all The writer be- 
lieves It preferable, however, to provide a rating scale for each 
item so that extreme opinions 01 feelings may be expicssed in the 
case of more than one item While it is probably tiue that one item 
in each group is reacted to with moie feeling than any of the 
others, at the same time it is unwise to define the number which 
shall receive extreme ratings 

In this connection it should be emphasized that in psychological 
questionnaires every item should be answered, so that all may 
receive statistical treatment. In case the item is unknown to the 
respondent, he may choose for his answer the neutral or average 
position. The interpretation placed on scores of questionnaires 
of this type depends on the position of the score within a distribu- 

•Hart, H, “A Test of Social Attitudes and Interests,” University of lojia 
Studies in Child Welfare, Vol II, No 4 (July i,ei923) 
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tion determined in some typical gioup In order that the com- 
parison may be valid, the same number of items (all the items 
in the questionnaire) should be answeied. If this is not done, some 
kind of percentage must be used, and the interpretation of a 
percentage is not certain when the answered items are carefully 
chosen and hence are probably not of equal value. 

The direct vs. the subtle question. Some of the most successful 
questionnaires designed to measure attitude have used a disguised 
foim of question Children in answering questions seemingly in- 
tended to elicit information concerning their beliefs or preferences 
aie thcieby made to show you impoitant behavior trends in other 
directions The Watson “Suivey of Public Opinion on Some Re- 
ligious and Economic Issues” contains good examples of this In 
taking these tests the subject makes certain choices or judgments, 
thereby disclosing his tendency to radicalism or liberalism in eco- 
nomic or religious fields One of the tests in the Watson battery 
gets at these attitudes by determining the tendency toward ration- 
alization. In one test the subject is asked to judge the strength 
of arguments, thereby disclosing his tendency to disregard the 
logic of the situation in favor of a prejudice in one or another 
direction In another test a situation is brieHy described and cer- 
tain infeiences aie drawn The subject is required to judge whether 
the conclusions fairly follotv or not, thereby giving himself an 
opportunity to show prejudice m favor of one or another side 
illustrated by the inferences In still anothei test the subject is 
required to decide between the same issue when it is immediate 
and personal, and when distant and impersonal The degree to 
which the subject is inconsistent becomes a measure of the direc- 
tion and degree of prejudice 

The present writer in his “Studiousness Questionnaire” ** tells 
classes to whom the test is given that the purpose of the ques- 
tionnaire is to discover children’s interests Children never and 
teachers seldom surmise that the purpose of the questionnaire is 
to measure the tendency toward studiousness. 

It IS probable that disguised questionnaires are more valid than 
those which are straightforward in their approach. The straight- 
forward attack partakes too much of the nature of a test and per- 

* Watson, G B , The Measurement of Fairmmdedness, Teachers College Con- 
tribut'ons to Education, No 176 (1925) 

Journal of Educatioiud ‘Psychology, 19. 152-167 (1928). 
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mils the pupil to control his responses to fit his purposes The dis- 
guised questionnaiie, in which the pupil is told he is doing one 
thing, but in which the items are so selected that the result j'lelds 
a measure of something else, is the ideal situation for measuring 
conduct Since a measure of conduct should be a lecord of re- 
sponses in a prescribed situation, the test or questionnaire Itself 
often becomes a factoi sufficiently extraneous to the situation to 
spoil the results for generalizing purposes. 

Can. children introspect? When conduct questionnaires are 
given to children, the im estigator should ask himself the question, 
“Can children introspect well enough to answer these questions^” 
Introspection is a particularly difficult type of obseivation and re- 
quires considerable practice. Children do not naturally observe 
their own habits or methods of work and play Yet we assume 
in asking our questions that they have already made observa- 
tions of their conduct and are quite leady to answer our questions 

This form of retrospection, of looking back over one’s behavior 
processes and recounting them, is a feat that requires special 
training To make this more evident let the reader try to answer 
the following questions concerning golf 

Do you usually take one, two, three, or more preparatory 
swings before driving^ 

Does It bother you to have some one watch you diive^ 

Do you usually drive harder on the fust 01 last hole’ 

How much time (percentage of total) do you spend looking for 
lost balls ^ 

How long as a rule do you spend in building a tee? 

Do you look at your hands, the ball, the club, or the hole in 
putting ? 

How do you prepare for a match? 

And yet these questions do not differ in character from the 
following which weie gleaned from one of the questionnaires used 
to obtain data from high school students as to their methods of 
study. 

Do you usually read the assignment once, twice or three times? 

Does it bother you to study m a room where other people are 
talking? 

Are those lessons studied at home your hardest or easiest? 

How long, as a rule, do you spend on each lesson in the sub- 
jects you are carrying? 

How long do jrou study at a time? ® 
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Do you read your lessons aloud^ Do you close your book and 
repeat yoiu lesson to yourself^ 

Do you tiy to think out }our lesson mentally just before going 
to class ’ 

Horv do you picpaie foi an examination^ 

These golf questions seem to a golfer ridiculous Do the study 
questions seem iidiculous to a student^ The only golfer who stops 
to analyze his motements is the golf instiuctor oi the professional 
y ho wants to \\ rite a book * 

Probably the answeis to questionnaiics which require intro- 
spection by young childien aie laigcly chance answers To be on 
the safe side, one should cither icquiic that childien obseive their 
own conduct oi habits of woik for a week or so before the ques- 
tionnaiie is used, or the questions should cover familiar facts and 
everyday expeiiences which are likely to be obseived 

Can adults retrospect to their childhood? Another piessmg 
problem with regard to the use of the questionnaire method for 
the collection of facts of conduct is whether adults can accuiately 
tccall facts and experiences of their childhood and adolescence 
Years ago Thorndike answered this by saying “Adults even 
so well tiained as college seniors and even m the simplest matters 
of present objective fact such as_are involved in the questions, 
‘How tall are you^’ and ‘What is the circumfeience of your 
sistei’s head" make gross eirois The eirors inciease in number 
and amount when the rcpoit rcquiics memory, increase further 
when the fact is a leport of subjective condition, and multiply 
like bacilli when it involves the consideration of the general drift 
of a series of experiences ” 

And yet the whole theoiy of psychoanalysis is based on belief 
m the mtegrit} of recalled peisonal experiences Breuer’s “Cathar- 
tic Method,” which is the basis of Freud’s later method, is de- 
scribed by Wohlgemuth as follows “A highly unpleasant experi- 
ence occasions a psychical shock, or wound, or trauma, which 
gives rise to certain reactions The occurrence itself is forgotten 
and as far as consciousness is concerned, is apparently obliterated, 
for It cannot be recalled or revived by ordinary means It, the 

■' Symonds, P M, “Methods of Investigation of Study Habits,” School and 
Sociriv, 24 146-147 (July 31, 1926) 

I’horndikc, E L , The Original Nature of Man (Bureau of Publications, 
Teachers College, 1913), p. 32 
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memory of the ocniirpnre, continues, however, to exist uncon- 
sciously, It is jammed in, and acts like a foieign body in a wound 
It manifests itself by the continuance of the oiiginal leactions, 
nhich thus become hysterical symptoms. Bringing the memory 
of the occurrence back into consciousness is like extracting the 
foreign body from the wound The emotion, which at the occur- 
rence of the shock had had no oppoitunity to ‘woik itself off,’ 
and so was jammed in, was stiangulated, can do so now, and 
the hysteric symptoms cease to recur ’’ 

Freud’s theoiy of the impoitance of infantile reactions deter- 
mining subsequent behavior has had phenomenally wide notice, 
and lecent attempts to demonstiate the importance of eaily child- 
hood experiences in causing subsequent maladjusted behavior 
have taken the repoit of childhood experiences at face value. 
Chassell,**’'* for instance, in his “Experience Vaiiables Record,” 
piovidcs sections for noting the occurrence of expeiicnces in 
childhood, eaily teens, and leccnt (times) Chassell recognizes that 
theie IS a tendency to remember outstanding events and to neglect 
the more important habit-forming experiences of everyday life. 
He says, “An inteiesting problem lemains of how to distinguish 
betw'een rare events that weic, perhaps, strongly influential (‘trau- 
mata’), and the ever-recuriing, but less ‘affective’ day by day 
round of events ” Later on in giring correlations of reported early 
experiences with present adjustments, he says, “In considering 
this evidence, we must not fail to bear in mind that the data aie 
memories and lepoited attitudes, and that the interrelations are 
to this extent subjective, rathei than being associations between 
present observed attitudes and proved past events ” 

But this cautious attitude is by no means universally held. 
House has to explain the fact that the best adjusted of his college 
students reported the most maladjustment as children The ob- 
vious explanation that memory of childhood experiences is inac- 
curate he disclaims “Though recency may be an important factor 
in accuracy of recall, frequency and intensity are no less signifi- 
cant for the persistence of memories as witness the tenacity of 

• Wohlgemuth, A , J Critical Examination of Psycho-Analysis (Allen and 
Unw'in, 1923, The Macmillan Company, 1924.), p 46 By permission of The 
Macmillan Company, publishers 

** Chassell, J 0 , Experience Fanables a Study of the Variable Factors hi 
Experience Contributing to the Formation of Personmlity (published privately 
Rodicsicr, N. Y, 1928) ^ - 
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memoiics rooting in early childhood and preserved even in senil- 
ity \Vc might reasonably argue, then, that being older does not of 
Its own accoid militate against the retention and recollection of 
memoiies sprung from childhood situations, especially when these 
latter are universally distributed, and in no sense unique occur- 
rences. ^ 

Our conclusion Is that it is exceedingly risky to take memories 
of childhood experiences at then face value Even if they were 
real experiences, their affective accompaniments may have magni- 
fied them out of all proportion to their tiue impoitance Probably 
most of our lecalled experiences are iccalls, not of the experiences 
themselves, but of our memoiics Even so, as Chassell found, an- 
swers to questions as to childhood experiences may have high 
diagnostic significance Perhaps this significance is due to the con- 
struction which we place on our past expcnences lather than on 
the significance of the expeiiences themselves 

Questions of reaction vs. questions of environment. Question- 
naires designed to measure various types of adjustment may be 
divided into those uhich ask questions conceinmg a pei son’s reac- 
tions and those that ask questions conceinmg a person’s environ- 
ment The Woodworth “Psycho-neiuotic Inventory” illustrates the 
former This paiticular questionnaiie asks questions about the 
subject’s worries, fears, physical distuibances, and social adjust- 
ments The Chassell “Experience Vaiiables Recoid” asks questions 
of the other type, grouped about such phrases as “Alother’s Ur- 
gency toward Subject Cariying Out Hci Way,” “Alother’s Sever- 
ity in Dealing with Subject,” “Tendency of Parents to Favor Other 
Children above Subject,” etc The measure of home environment 
could also be classed with this gioup Accurate comparisons of 
these two types of questions foi measuring adjustment have not 
been made Reasoning a pi ion, it would seem as though more 
reliable results might be obtained by using the questions on the 
environment, since it seems probable that these matters have been 
the obj'ect of observation more frequently than one’s own re- 
sponses. Individuals differ, however, in their tendency to observe 
the details of their social surroundings or of their own reactions. 
Research alone can tell us which is the better type of question to 
use for this purpose. 

'House, S D, Mental Jlygiene Inventory, Archives of Psychology, No 88 
(19^7)1 P. SI. ' - ' , f 
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Validity of Questionnaires 

One should distinguish caiefully between the validity and re- 
liability of psychological qucstionnaiics Quest.onnaires, even 
though somewhat unreliable because of the inaccuracy of the rat- 
ings or judgments given as answers to the questions, may never- 
theless have high validity If this is the case, the instrument could 
be made more reliable by extending it, making it more objective, 
or standardizing its procedure Once we have demonstrated that 
we are on the right track in measuiing important attitudes or 
phases of personality, technical skill can be applied to making the 
instruments more accurate. 

Selection of items. In selecting valid items for a questionnaire 
there are two methods in vogue One consists in careful analysis 
of the field being measured, followed by assemblage of questions 
on the basis of this analysis This was the method employed by 
Heidbreder* and Freyd in measuring introveision-extrovef- 
sion Fieyd made a careful analysis of the manifestations of intro- 
veision-extroversion, drawing on the wiitings of Jung and others, 
and Heidbreder assembled the analysis into a set of questions. 

The other method is empirical; the items aie selected by try- 
ing them out on groups of subjects vaiying in the quality or 
characteristic that is to be measured Theie are two methods of 
obtaining this variation in the subjects to be used in validating 
the test Items One is to have them rated by associates on the 
quality to be measured The other is to use groups that have al- 
ready been selected socially so as to lepiesent extremes of the 
quality. The latter method, for example, was used by Lentz f in 
attempting to discover tests of delinquency He tried out his tests 
on boys in a probation school as representing the delinquent 
group and boys in a public school as representing a normal group. 
Society in this case had already made the selection. Cowderyff 

• Heidbreder, E , “Measuring Introversion and Extroversion,” Journal of Ab- 
normal and Social Psychology, 21 izo-134 (July, September, 1926) 

••Freyd, M, “Introverts and Extroverts,” Psychological Review, 31 74-87 
(1924) 

t Lentz, T F , Jr , An E-cpenmental Method for the Discovery and Develop- 
ment of Tests of Character, Teachers College Contributions to Education, No 
180 (192s) 

tt Cowdery, K M , “Measurement of Professional Attitudes,” Journal of Per- 
sonnel Research^ 5 131-141 (August, 1926). 
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used the same method in validating his intciest questionnaiie on 
doctois, lawjeis, and engineers — groups already socially differ- 
entiated. 

In this second type of item-selcction, one makes only a rough 
analysis of the chaiactciistic being measuied, admitting that 
such an analysis is piematuie before the characteiistic has been 
studied objcctucly I'o this end, lathcr a large and miscellaneous 
collection of items is gatheied together which might by chance 
show some relationship to the quality being measuied These items 
arc then tried out on the subjects rcpiesenling degrees of the 
quality in question, and those items aic letaincd which aie an- 
swered differently by the contrasting gioups, or which show a 
substantial correlation uith known dcgiees of the quality in the 
criterion group This was the method employed by Stiong in 
determining the scoring key for his inteiest questionnaiie in dif- 
ferentiating the professions which he studied 

A note of caution must be sounded heic. The validity of the 
questionnaire must be tested out on a nezo gioiip, not on that 
used in selecting the items In violation of this rule, remarkably 
high validity coefficients have been obtained by computing valid- 
ity on the same data which were used in choosing the items The 
original elation over this achievement turned to disappointment 
when the questionnaire failed utterly to show as favorable results 
in a new situation. This phenomenon, which we may call “weight- 
ing the probable error,” is to be explained as follow's The score 
made by any pupil on a test or questionnaire is the result of 



The theoretical probability curve of the distribution of scores made by an 
individual repeating a test many times. The base line represents the scale of 
scores of the test 


many factors, some of which are chance factors, so that if the 
individual were to repeat the test even under similar conditions, 
his score would not be the same as on the first trial It is known 
that If an individual were to repeat a test inEiny times his scores 
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would form a normal probability curve centering around some 
value which is his true score * 

Now when items are selected empirically, as desired above, and 
the resulting test or questionnaire has a reliability greater than 
zero, if then a score is computed for any individual on the same 
data, this score will be above or below the true score for that 
individual If his score is high, it is too high, and if it is low, it is 
too low, both in the direction which yields the most advantageous 
correlation with the criterion When the test is repeated there is a 
tendency for these scores which are too high or too low to fall 
back closei to their true scoics, which tends to lower or attenuate 
the relationship with the ciiteiion. In short, the safest way to 
avoid tilts illusory phenomenon is never to compute a validity co- 
efficient on the same gioup which was used for selecting the items 
or for determining the weight of the items 

Besides this spuriously high validity which results fiom trying 
out tests in the same situation as that in which the items w'ere 
selected, theie is some reason to believe that questionnaire results 
depend on the special situation in which they were obtained more 
than do test results Ability as shown by tests has a remarkable 
consistency, but the adjustments or interests which are tested by 
questionnaiies tend to be moie ephemeral — the ache or pain may 
have been specially noticeable tlie last few days, enjoyable ac- 
tivities may be seasonal, one is constantly adding new fiieiids, 
acquiring new ambitions, or wearing out old fears We can, then, 
never hope for the consistency in the case of questionnaiies that 
IS found in the best tests However, this matter may be ovci mag- 
nified, for on the one hand questionnaires have actually shown a 
consistency that has suipiised even those w'ho have woiked with 
them, and on the other hand these variations need not be de- 
plored, since they probably indicate important and significant 
changes in the individual. 

Veracity. The veracity with which questionnaires are answered 
is a matter of great importance. There are two cases when ques- 
tions will not be correctly answered, (a) when the person ques- 
tioned does not know the answer, and (b) when he does not want 
to give the correct answer. Before taking up the rationale of the 

•Thorndike, E L, “The Variability of an Indiviikal in Repetitions of the 
Same Task,’ Journal of Experimental Psychology, 6 101-167 (1923). 



1^0 Diagnosing Personality and Conduct 

matter of veracity, let us describe the rather meager experimental 
findings 

Persing (28) made a comparison between what students in a 
chemistiy class saj they would do in lopoitmg eiiois in the giad- 
ing of examinations rctuined to them and what they actually do 
He systematically made eriois in giading examination papers for 
a given class Some giades weie entered too high, others too low 
The students were requested to icpoit to him any errois w'hich 
they found in the marks on then papeis Giades w^eie leported 
as too low by 97 per cent, but only 9 5 per cent reported grades 
as too high Later in the year Pcrsing asked these same students 
various questions regarding their attitude toward examinations 
The results show'ed that 97 per cent said they w'ould leport low 
giades if they found them and 805 per cent would report grades 
that were too high The comparison is between this profession of 
80 5 per cent of the students with the actual 9 5 per cent who did 
so report over-grades It suggests that statements of attitude are 
to be taken at face value if they are neutral or in favor of the 
interests of those concerned, but that statements of attitude tend 
not to agree with conduct when it is a matter of opposing one’s 
own interests. 

Hartshorne and May * during the course of their experiments 
used a “Pupil Data Sheet” which included, besides a number of 
questions of a general character, twelve questions relating to 
cheating on tests in school. Samples of these are (16, p 127) • 

33. Did you ever cheat on any sort of test^ 

34. Have you cheated on such tests more than once? 

45 On some of these tests you had a key to correct your paper 
by. Did you copy any answer from the keys ? 

57. Have you answered all the questions honestly and truth- 
fully? 

These experimenters had previously administered honesty tests 
lo these same school-children to find out to what extent the chil- 
dren would cheat if given opportunity. The answers to these 
questions were then matched with the record of their actual 
cheating A he index was constructed to measure the extent of 
their lying, such that if they lied more often than not, the he index 

• Hartsho ne, H , and May, M A , Studies m Deceit (The Macmillan Com- 
pany, 1928), pp 94 S. By pennission of The Macmillan Company, publishers. 
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was positive; if they admitted their cheating, the lie index was 
negative, if the lies and truths were equally divided, the he index 
was zero This he index was applied only to those children who 
actually cheated, as “the others had nothing to he about ” In 
one school it was found that 83 o per cent lied, 15 5 per cent were 
truthful and i 4 per cent showed a balance of truths and lies. 
In another school 85 i per cent lied, 10 7 per cent were truthful, 
and 4 I per cent shoived a balance of truths and lies The con- 
clusion must be that one does not get truthful answers from chil- 
dren about their conduct when it is against their interest to tell 
the truth or when they evidently would be criticized for what they 
said. 

In another investigation Wylie (41 ) studied the validity of an- 
swers to a questionnaire given in school to determine the socio- 
economic status of the home This study was a piece of follow-up 
work growing out of the Newaik Survey of the all-year school 
Case workers were able to go into the homes of twenty-nine of 
the children and check up on the answers to their questions as 
to the home furnishings, occupation of parents, etc. 

The percentage of accuracy of some of the items is given in the 
following table • 

Table 21 

s 

Percentage of Accuracy in Answering a Questionnaire 


(from Wylie) 

Percentage of 
exact agreement 


Did mother attend high school^ .... 

929 

Number of books m home library 

24 

Telephone in home 

929 

Family possession of auto 

8z8 

Bathtub in home . 

828 

Does mother work' 

923 

Occupation of father 

62 s 

Country in which father was born 

100 

Country in which mother was born 

889 

Language mother speaks 

769 

Numbers of rooms m house 

. . 690 

Number of persons m family 

586 

Possession of library card 

*4- 


This follow-up work gives a very good picture of the accuracy 
with which children answer questions of this type. Matters of 
fact such as possession of a telephone are answered with a high 
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degree of accuracy. Questions where some interpretation is neces- 
sary or where the item is not immediately observable are answered 
less accurately. In the case of the father’s occupation some chil- 
dren see him leave home every morning but have no idea where 
he goes or what he does Others know in a vague way that he 
works in a factory or office but have little conception of his posi- 
tion or the type of work he does In other questions it is a matter 
of definition, which may perhaps account for the errors in giving 
the number of persons in the family In still other cases there 
is a definite lack of observation and we should not wonder at 
the low percentage of coirect replies to the question concerning 
the number of books in the home In this case Wylie goes on to 
show that the percentage of agreement between children and case 
uoikers, if a leeway of ten volumes is allowed, is 60 per cent and 
if a leeway of twenty-five volumes is allowed, 76 per cent 

It has many times been pointed out that one does not get the 
truth by asking many people who do not know what the truth is 
Theie is no magic in the number of persons who answer a ques- 
tionnaire. Yet how distressingly oft n a student, seizing upon a 
question to which no one knows the answer and which has an 
important bearing on educational practice, will attempt to gam a 
solution of It by sending out a questionnaire to be answered by 
persons who know perhaps even less about the matter than he 
does. Statistical methods cannot extract the truth by taking the 
average of prevailing ignoiancn as to the objectives of different 
studies, best methods of work, standards of action, and the like. 

On the other hand, Wylie’s results show that whereas one can- 
not necessarily take an individual’s answers at face value, the 
total or average of a number of answ^ers may possess a high degree 
of accuracy. For instance, although only seventeen 01 1 of twenty- 
nine children (58 6 per cent) gave the correct answer as to the 
number of persons in the family, the total number of persons men- 
tioned as members of one family or another was 202 for the pupils 
as against 204 for the case workers, representing an accuracy 
of 990 per cent. Tf errors are compensating, then the total or 
average may represent a high degree of accuracy. Again, one 
out of four children answered wrongly as to the 1 ossession of a 
library card, yet the total number of cards reported by the li- 
biary was 115 as against 131 reported by the pupils, or an agree- 
ment of 86.1 per cent. Taken in this sense, the average of a large 
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number of answers possesses considerable validity, for, statistically 
speaking, the reliability of the average may be increased by in- 
creasing the number of individuals canvassed Eriors tend to com- 
pensate when the answer is a pure matter of judgment Errors 
tend to be cumulative when there is bias or “interest” in the 
answers. 

It IS seldom possible to get at the truth by asking a question to 
determine desirable norms of conduct when the answers depend 
on the cumulative experience of those answering it, for habit plays 
a large part in determining attitudes Matters of clothing, taste, 
and style are notoriously conventional, and obviously depend on 
the conditions of previous experience One’s tastes or preferences 
should never be confused with the truth of an issue, since often 
our preferences have little relationship to logic. 

I Wylie recommends the use of inteilocking questions as one de- 
vice for obtaining greater accuracy from the results of a question- 
I naire For example, to ascertain age it is well to ask both for the 
age and for the date of biith, so that one may be used as a check 
against the other In cases where there is a deliberate attempt to 
deceive or conceal, as must have been the case with some of 
Hartshorne and May’s results, the consistency of the answers 
would be no check against their truthfulness. 

Davis (10), in studying the answers to a questionnaire to chil- 
dren on health habits, deduced the following checks of accuracy. 

(a) The answers should check with general knowledge and ex- 
perience If the answers agree tolerably well with results found in 
other surveys, they may insofar be trusted If, for instance, chil- 
dren report that they sleep on the average ninejiours a day, and 
these results are similar to the findings of other Studies, they may 
be trusted 

{b) The results should vary with varying conditions Answers 
to the question, “Arc you up at seven o’clock in the morning^” 
show that nearly 2 per cent more answer yes m May than in Oc- 
tober, as would be expected, because the days are longer and the 
mornings lighter in May than in October. 

(c) When conditions are constant, the answers also should be 
constant. To the question, “Is there a bathtub in your home?” 
the answers for grades 4, 5, 6, 7, and 8 were 86 per cent, 86 per 
cent, 87 per cent, 88 per cent, 89 per cent. This consistency is to 
be expected, as there is no factor which operates to make the 
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percentages of, homes possessing bathtubs vary from grade to 
grade. 

(d) Testimony of individual schools or groups of schools In 
a school where more than 95 per cent of the pupils were either 
Japanese or Chinese, the peicentages of tea-drinkers was 54, while 
the percentage 101 the whole city was only 19 In one seventh 
grade in a school in the best residential district, 100 per cent of 
the pupils lived m homes which had bathtubs, while in an entire 
seventh giade in an outlying district the percentage was 28 This 
is the kind of consistency in the answers for groups which tends 
to prove that the leturns possess a high degree of accuracy 

In psychological questionnaires (for example, the Woodworth 
Psychoneurotic Inventory) the matter of veracity does not enter 
so strongly as a disturbing factor. These questionnaires are given 
under standardized conditions, and their significance is determined 
by their correlations with other factors If certain pupils do not 
choose to answer the questions truthfully, this fact is taken care 
of by the correlations It is not the question whether the pupil 
will tell the truth or not, but how his answers compare v'’th those 
of some one else when asked under standardized condi.’ons, and 
the w'ay these differences are related to other factors. 

As measuies of attitudes, questionnaires have been under fire 
in this matter of veracity. There may oftentimes be a tendency 
to answer for effect Vfe have referred (pp 150, 15 1) to the studies 
of Persing and Hartshorne and May to show that under certain 
conditions there is divergence between what pupils say and what 
they do The common impression is that a person will act to 
further his own interests, and when asked a question, will reply 
m order to produce the most favorable impression However, the 
writer has long been of the impression from casual observation 
that pupils in school will answer these questionnaires rather truth- 
fully One reason for this is that pupils confuse these question- 
naires with tests, and with the general school attitude of trying 
to do their best on a test they seriously try to give truthful 
answers on questionnaires concerning even their most personal 
affairs. Another reason is that the same motive that causes a per- 
son to make a good impression on a questionnaire would also 
make him wish to give a good impression in conduct In his book 
The Nature of Conduct (pp 231-233) the author expresses him- 
self as follows on this point: , 
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“Verbal expression does not necessarily truthfully portray inner 
facilitation oi resistance In cases of dissimulation one may really 
speak contrary to actual readiness Boasting is an example, ra- 
tionalization IS anothei A child may claim to be unafiaid, for 
instance, of going on a boat or into a daik 100m but mav balk 
at the actual act Inexperienced persons, on this account, distiust 
those questionnaires which have recently been devised to measure 
a person’s interests or attitudes with special reference to his 
progressiveness or conservativeness The objection raised is that 
It IS easy to state one’s beliefs or interests one way or the other 
regardless of one’s real interests or beliefs The answers may be 
guided by one’s opinion as to how other people wull judge one by 
them It happens, however, that in actual practice such dissimula- 
tion takes place only infiequently One’s natural (although prob- 
ably learned) reaction is to make the verbal statement harmonious 
with the actual readiness or unicadiness It would also seem that 
if a person is influenced by what other people say or think, he is 
usually influenced not only in his words but also in his in- 
teinal readinesses and unreadinesses Most persons have learned 
to make their words and thoughts match their actions It is the ex- 
ception for a tcetotalei to say that he does not believe in pro- 
hibition Most men w'ho say they disbelieve in prohibition will 
suit action to the word on the pioper occasion A poll or stiaw 
ballot on a group’s attitude toward prohibition is apt to give a 
pretty accurate pictuie not only of what the group professes but 
of their actual conduct It is for this leason that these question- 
naires designed to tap attitudes have pioved themselves rather 
leliable and significant indicators of conduct trends ” 

Recent experimental data published by the Character Educa- 
tion Inquiiy tend to substantiate the above statements. Quota- 
tion will be made only from the conclusions as stated by May 
and Hartshorne, as a description of their experimental technique 
would be somewhat involved- 

“Over against the fact that 89 per cent (of children) stated 
that It was their duty to read the Bible every day (apparently a 
conventional response) must be set the fact that very few of 
these children changed their answers when confronted with the 
answer sheet which showed that the ‘standard’ did not regard it 
as their duty to do so ... We may say, on the one hand, that 
the moral knowledge scores, therefore, which differ widely from 
child to child, are not merely efforts to repeat the school stand- 
ards, but represent something more fundamental Or, on the 
other hand, we may say that the tendency to make a good ap- 

*Symonds, P M, The Nature of Conduct (The Macmillan Company, 1928) 
By permission of The Macmillaa Company, publishers 
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pearance does not correlate with moral knowledge . . . The con- 
sistency of results, as found by coi relating comparable forms of 
the same test, indicates that nhat is stated as moial knowledge has 
a certain coherence and stability, and whether lived up to or not, 
points the way to action that is regarded as proper ” * 

With refeience to the relation of verbal attitude and conduct 
these authors say » 

“Ninety-one ner cent of those who say it is all right to let 
another pupil cop}' youi work and hand it in as actually his own 
cheated themselves One hundred per cent of those preferring to 
smash the slot machine to recover their lost nickel actually cheated 
on a test Ninety-three per cent of those who thought it right for 
John to cheat in older to help his class win actually cheated 
themselves It is noteworthy that these high agreements among 
the cheaters are in legard to cheating in two cases, to property in 
the third, and not tn any instance to other types of behavior. 
This IS somewhat surprising, since one would not expect a cheater 
to wear his heait on his sleeve ” 

The present writer’s own interpretation of the data given in the 
article would not be co strongly in favor of the position quoted 
For example, there w'ere many moie pupils who said it was wrong 
to cheat and did cheat than said it w'as right to cheat and did 
cheat The numbers on which the percentages are based are very 
small The zero correlation which these investigators found be- 
tw'een moral knowledge and conduct proves nothing, except that 
both knowledge and conduct are vei}* specific 

The safest plan is to wait for adequate experimental results be- 
fore deciding as to the dependence one can place on the truthful- 
ness of answers to a questionnaire Eventually we shall know 
under wdiat conditions the truth can be expected and when it can- 
not be expected 

Methods of Scoring Questionnaires 

In the fact-finding questionnaire there is no occasion for sum- 
ming the answers. The answer to each question conveys an im- 
portant Item of information to be tabulated separately. In the 

Hartshorne, H , and May M A , “Testing the Knowledge of Right and 
Wrong,” Religious Education, 21 419 f (August, 1926) 

** Ilartshorne, H, and May, M A, in Religious Education, 21 627 (Dec, 
1926). - - - _ ; - 
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psychological questionnaire, where all of the questions attempt to 
get reactions on various phases of human attitude or interest, it 
is customary to sum the answers For instance, if the questions 
are designed to show the subject’s reaction to various social issues, 
It IS possible to sum the responses so as to show a radical or 
conventional tendency. In the interest questionnaires one can 
sum the answers so as to get a measure of interest in broadly 
designated activities or occupations Certain investigators, to be 
sure, have refused to sum their answers Reasons given are that 
each question is valuable on its own account, that to give a single 
score would obscure differences in the way in which individual 
questions were answeied, and that a total score would have no 
meaning, since it stands for the answers to a miscellaneous group 
I of questions Much depends on the use to be made of the results 
of the questionnaire If it is to become a basis of group discussion, 
or to reveal to an individual his own idiosyncrasies, or to guide 
a doctor in giving advice or administering treatment, then the 
answer to each question is of value But if it is to be used to 
measure attitude in some general sphere such as racial relations, 
religion, or the economic order, or to measure interest in some 
phase of life, or to measure the degree to which successful adap- 
tation has been made to some part of The environment, then there 
is every reason for obtaining a total score. It is a satisfaction to 
report that sufficiently high correlations have been found between 
the halves of questionnaires designed to measure such characteris- 
tics as psychoneurotic adjustment, introversion-extroversion, in- 
feriority complex, fair-mindedness, studiousness, and ascendancy- 
submission to give us firm ground for believing that such qualities 
are real characteristics of the personality and that they may be 
measured by the questionnaire-rating scale method. 

The method of scoring a questionnaire, once it has been de- 
cided that scoring is advisable, is laigely determined by considera- 
tions similar to those described in selecting the items. One may 
use a common-sense scoring method which gives credit only for 
answers which apparently are in the direction of or show harmony 
with the confact or characteristic being measured Such a key 
may be derived most simply from a prima facte consideration 
of the questions Better than this is to obtain the direction of 
the questions by a consensus of judgment of several competent 
persons. The present writer, for instance, in his “Social Attitjides 
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Questionnaire” * determined the liberal side of questions on social 
issues by asking five experts to answer the questions with what 
they considered to be the libeial position Items on which there 
was no practical agieemcnt were discarded 

Another scoring method consists in trying out the questions 
first and determining the key from the results This may be ac- 
complished m two ways The first of these is to tabulate the 
responses on two sharply differentiated groups already separated 
either by social selection or by ratings on the quality or charac- 
teristic in question The other method is first to score the ques- 
tionnaire, using a common-sense or a -prion key, and then to check 
the answers to each item against high or low total scores on the 
questionnaire. This can be done in various ways, of which an 
elaborate one is to compute the bi-serial r for the answers to each 
item against the distribution of scores. A simpler method is to 
compute the number of yes answers for sub-groups making above- 
average and below-average scores in the group originally meas- 
ured In this case the key should give credit for the answer yes 
if a laiger percentage of high-scoiing than low-scoring persons 
answers it that way. Where the group is large, one may abridge 
the work by using the upper and lower 25 or 20 per cent of the 
group.''"^ 

In case more than a twofold or dichotomous response (eg., 
Yes, No') IS obtained on a question, the problem arises of as- 
signing credit for different answers This pioblem becomes acute 
when a graphic rating scale is used for entering the answer, as 
in the “Colgate Personal Inventory.” Here one is confronted with 
the task of evaluating the position of the rating for the purpose 
of using it most effectively in determining a total score on the 
questionnaire. Laird’s f solution was to determine a point on each 
rating scale above which lay 25 per cent of the answers in the 
direction of introversion and then merely to give one point credit 
for any answer which lay inside this extreme area. This method, 

* Journal of Educational Psychology, 6 316-322 (1925). 

'■•Kelley has determined that the optimum differentiation is obtained when 
answers made by the 27% of the group making the highest scores are compared 
with the answers made by the 27% of the group making the lowest scores. 
See Jensen, M B , Objective Diferentiation Between Three Groups in Education, 
Genetic Psychology Monographs, vol 3, no s, 1928 

t Laird, D A , “Detecting Abnormal Behavior,’’ Journal of Abnormal and 
Social Psychology, 20- 128-141 (1925). , 
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however, seems to the present writer to throw awaj most of the 
rewards to be had for the tiouble of using a giaphic scale Some 
such simple device as giving one, two, three, four, or five credits 
according to the fifth of the scale in which an answer lies seems 
an easy way of making more effective use of the sharper diffeien- 
tiation which the graphic scale affoids Freyd"* met the same 
problem when he had to construct a scoring scheme for answers 
to his interest questionnaire, which called for encircling L', L, ?, 
D, or D! His original decision to give credit for only one of the 
symbols encircled in each item was not followed by subsequent 
workers Exactly what methods strike the best balance between 
making the most effective use of the fineness of discrimination 
which the self-ratings afford and avoiding unnecessary difficulty 
in scoring is a task for fuither reseaich to decide If Laird’s solu- 
tion of merely giving a credit for answers m certain areas of each 
scale IS used, the method of answering the questions could surely 
be much simplified without becoming less valuable Likewise if 
only one or two of the answers is to be scored when a five-choice 
alternative is offered, the questionnaire would probably yield al- 
most as satisfactoiy results if it offered less than five alternatives. 
In general we should offer no more possible alternatives as an- 
swers than it is decided to use in scoring. 

This leads us to the matter of weighing items. It seems only 
common sense that answers to the more valid items should have 
greater significance and perhaps should be given more weight than 
answers to less valid items Two methods for doing this have been 
suggested One developed by Kelley and described by Cowdery ** 
is to be used where there are dichotomous answers. The weight 
to be used is 

{I -4,^)0 

where 

ad — be 

^-y/(^a+c){b + d){a-\-b)(,c + d) 

where a and b are the numbers answering yes and no to the ques- 
tion; and c and d are the numbers in each of the two groups 

•Freyd, M, “The Personalities of the Socially and Mechanically Minded,” 
Psychological Monographs, 33, No 4 (1924) 

••Cowdeiy, K M, “Measurement of Professional Attitudes,” Journal of Per- 
sonnel Research, $ 131-141 11926-1927). * 
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between which the questionnaire is expected to differentiate. 
Strong discarded this method m his woik because he found the 
results correlated almost perfectly (except for extreme values) 
with those to be had by merely taking differences between percent- 
ages of subjects answering the questions correctly Differences be- 
tween percentages were then coded down into a set of weights to 
be used in scoring. 

Watson * has used an adaptation of the McCall-Long method 
of weighting scores He describes the steps of his plan as follows. 

“i. Build the criterion 

“2 List across the top of a sheet every possible response to 
every element in the test 

“3. Tabulate the answers for each person in the criterion group, 
and enter his criterion score in the column corresponding to every 
answer which he has made Thus, suppose we had an individual 
whose criterion score in the test questions was 95 Suppose this 
person answered true to the first question, true to the second, 
checked the second alternative in the multiple-choice question, 
and ranked the ranking questions in the ordei 5, 3, 2, i, ,4 Then 
95, the individual score uould be listed in every column corre- 
sponding to the response he made The same would be done with 
the total scores made by other individuals 

“4 Add each column and divide by the number of entries in 
the column This will give the average value of each answer in 
terms of criterion score At the same time add the column of 
criterion scores so that we may find the average of the criterion 
scores. 

“S Subtract the average criterion score m algebraic fashion 
from the average of each column The resulting figure will give, 
if positive, the amount of credit that ought to be given for that 
answer; if negative the amount which should be subtracted from 
the score for any person who makes the particular response.” 

Watson adds the following comment, 

“If the test be scored in this fashion it will give high scores to 
the kind of people who stood high in the criterion, and it will give 
low scores to the kind of people who stood low in the criterion. 
Any elements which were answered in the same fashion by every- 
one receive zero rating. Any answers which were answered by 
nobody are eliminated Any dements which were answered equally 
often by people who stood high in the scale and people who 
stood low in the scale receive zero value. The difference between 

* Watson, G B , Expenmentation and Measurement in Religious Education 
(Association Press, 1927),'' p. ISS f. 
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the amount of positive credit which can be given for the right 
answei to a test question, and the amount of penalty which is 
given for the poorest answer, is a rough indication of the value of 
that element.” 

Two tests should be applied to any system of weighting items 
before the scheme is adopted. One test is whether the weights 
are consistently retained when derived from a new group. Unless 
the weights hold in a new situation, they should not be used for 
scoring The other test is whether the correlation with a criterion 
using the weights is sufficiently superior to the correlation with- 
out using the weights If the difference in correlations is insig- 
nificant, there can be no advantage in using the more compli- 
cated scoring device, because, as a general principle, a scoring 
scheme should be as simple as possible when it may be hoped 
that the test will be popularly used. 

Scores of amount and of direction. It is possible to score cer- 
tain types of questionnaire in two or more ways so as to bring 
out different characteristics of the response. Where the subject 
answering the questions is instructed to mark only those items 
about which he has definite opinions or feelings, the number of 
responses made has a significance This type of score is used by 
Pressey* with his X-0 tests — Pressey calls it an “affectivity” 
score Watson** takes the number of extreme answers as a 
measure of prej'udice m his tests of fair-mindedness Besides this 
total score, it is possible to take into account the direction of 
the score by considering the direction of the answer to certain 
groups of Items. This type of score Pressey calls an “idiosyn- 
crasy” score and by means of it he measures such abnormal re- 
sponses as disgust, fear, sex, and self-feeling Watson also uses 
a differential score to measure direction of prejudice m such 
fields as the economic order, race problems, and religious issues. 

Reliability of the Questionnaire 

The problem of accuracy of measurement in the physical sci- 
ences come down to the relatively simple matter of noting the 

•Pressey, S L, and Chambers, O R, "First Revision of a Group Scale for 
Investigating the Emotions, with Tentative Norms,” Journal of Applted Psy- 
chology, 4. 97-104 (1920) 

•• Watson, G B , The Measurement of Fairmmdedness, Teachers College 
Contributions to Education, No 176, pp 13-IS * 
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variations resulting from repetitions of a measurement In psycho- 
logical testing, however, it is impossible to obtain an exact repeti- 
tion because, once a person has taken a test, his familiarity with it 
precludes setting up exactly the same conditions The problem is 
solved, howevei, by recognizing that any test represents but a 
sampling of the field tested, and the repetition is accomplished by 
testing with another sample The correlation between the scores 
on these two supposedly equivalent tests given at different sittings 
is used as a measuie of icliability and is called the reliability 
coefficient 

In detei mining the reliability of questionnaires it is often diffi- 
cult to gather another set of questions which would represent a 
sampling of the field equivalent to the original one A method of 
circumventing this difficulty is to coi relate the scores on randomly 
chosen halves of the questionnaire, afterwards correcting this re- 
liability coefficient of one half against the other half by the 
Biown-Speaiman formula, which enables one to estimate the cor- 
relation of die whole questionnaire against another similar one. 
This method does not take into account differences in the subject 
between two sittings It is really only a measure of the internal 
consistency of the questions, a very important consideration, how- 
ever, when one is mcasuiing in a new field. The method of re- 
peating the questionnaire has sometimes been employed, but no 
one can know to what degiee the questions are remembered and 
deliberate changes made in the answers Cady,* following Kelley’s 
suggestion, has used the ingenious device of rewriting the ques- 
tionnaire so that the answers to all questions are reversed. For 
instance 

(First form) Do you ever have a strong desire to set fire to 
something ’ 

(Second form) Do you dislike the idea of setting fire to some- 
thing to see It burn^ 

Although on the surface of things this seems to ensure com- 
parability of the two forms, it really produces a new set of ques- 
tions Concerning this Cady states, 

“A lack of clearness and directness in many of the questions' 
will be apparent It is very difficult to change a statement from 

*Cady, V M, The Estimation of Juvenile Incorngibility, Journal of Delin- 
quency Monographs, November 2, 1923. 



The Questionnaire 163 

the natural direct way of expression into a question that says 
the same tiling, yet lequiies an opposite leply The leault is a 
number of expressions which are piactically double negatives If 
they are not of this natiiie, they often do not mean quite the same 
thing, though perhaps neai enough foi piactical puiposes A 
portion of these difficulties was overcome by explanations made 
to the subjects which helped to clear things up for those of less 
intelligence and yet did not introduce new meanings . . It is 
open to question, however, whether this device of a reversed ques- 
tion would be as valuable as, say, 100 carefully selected questions 
divided into two forms and given at different periods ” 

A few typical reliabilities of questionnaires taken from the ex- 
perimental literature are as follows: 
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Symonds Studiousness Questionnaire Symonds 213 high school 100 Split halves cor- 

seniors reeled by Spear* 



Zeleny Trufr-False Opinion Test Zeleny 35 college atu- 68 Correlation of tiro 

dents parts corrected by 

* Spearman-Brown 

Zeleny True-False Opinion Test Zeleny $8 college sti}- 68 Correlation of two 

dents parts corrected by 
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These reliability figures compare very favorably with other 
similar reliabilities obtained fiom iccall multiple-response and 
true-false tests Ruch *■ gives the follow mg tabic which summarizes 
work on reliability with objective tests. 

Table 23 

Reliability of Typicvl Objectiye Subject-Matter Tests 
(from Ruch) 


RELIABILITY 

OF 100 ITEMS 





Ewe- 

Three- 

Two- 

True- 


Recall 

response 

response 

response 

false 

Toops 

76+ 

715 



673 

Ruch-Stoddard 

896 

866 

748 

849 

714 

Ruch-Stoddard Social Study 

9 SO 

882 

890 

843 

837 


Probably the same factors influence the reliability of question- 
naires that influence the reliability of tests, with, however, certain 
factors assuming particular importance in determining the relia- 
bility of the questionnaire It has been found that questionnaires 
are more reliable for adults than for childien There aie three 
factors which may go toward explaining this In the first place, 
since adults are probably better able to observe and introspect 
than arc children, their answers to questions should be more 
accurate and consistent, as is observable, for example, m the 
Woodworth Psychoneurotic Inventory In the second place, it is 
probably true that most personal experiences, characteristics, etc , 
are better developed in adults than in children, and hence stand 
in a clearer light and are not so difficult to distinguish This 
phenomenon would apply particularly to interest questionnaires 
A third factor is the matter of intelligence necessary to under- 
stand the Items For younger children, many of the questionnaires 
become intelligence tests or vocabulary tests because of the in- 
herent difficulty of the concepts and the language employed in 
them. Children would have difficulty in answering the following 
lines taken at random from the Pressey X-0 Tests: 

extravagance sportiness boasting deformity talking-back 

fire _ nervousness germs insult disfigurement 

Mowgli Tarzan D’Artagnan Hamlet Gallahad 

* Ruch, G M , and others, Objective Examination Methods in the Social 

Studies (Scott, Foresman akid Company, 1926), p. 83. 
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In cases where the issue becomes one of not understanding the 
item, the score tends to reduce to 50 per cent of the items an- 
swered, and chance becomes the siipieme factor in determining 
the score. 

Norms 

Shall norms be given for questionnaires? By all means, since 
norms for typical groups which have answered the questionnaire 
help to interpret the results of measurement Only by considering 
the present case in the light of previous experience can judgment 
be made as to the significance of the results Besides the average 
score made by the group, percentile scores should be given, so 
that one may know what percentage of the individuals in the group 
tested fall above a certain score In many cases norms or per- 
centile scores for contrasted groups are necessary so that one may 
get at the typicality of a given score for a given group For 
instance, norms should be available for the Woodworth Psycho- 
neurotic Questionnaire in the case of normal groups, psychoneu- 
rotic groups, and delinquent groups, etc, thus permitting esti- 
mates of the probability that any score will fall in any of those 
groups Likewise norms and distributions for the Cowdery Vo- 
cational Interest Blank should be provided for each group to be 
differentiated Age norms seem of less value in the case of ques- 
tionnaires than in the case of tests. The reason for this is that 
in the former it is difficult to perceive a real growth or change 
due to age Whatever changes do occur during the ages of growth 
can probably be explained as due to an increase in ability to read 
or to greater comprehension of the situations described 

We should be on our guard against making a false interpretation 
of the meaning of norms. A common error is to accept norms as 
standards or goals, an error quite likely to be made with the 
results of attitudes questionnaires An individual tends to feel a 
certain degree of satisfaction if he reaches the average attitude 
of his group. Though to think or believe like the group we re- 
spect IS all that we usually desire, the group may not be right. 
Perhaps we ought to be more open-minded or more tolerant than 
any group measured. Science can never tell us what we ought to do 
merely from studying that which has been done. However, in 
reaching a decision as to what should be done, a knowledge of 
th? existing state of affairs is helpful. . 
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Types of Distributions Yielded by Questionnaires 

Most questionnaires yield distributions which are practically 
normal It is significant that questionnaires designed to show char- 
acteristics of adjustment such as psychoncurosis, introversion-ex- 
troversion, supeiiority-mferionty, and ascendancy-submission all 
show approximately normal distributions Most persons, then, are 
not introvert or extrovert — they are ambivert Most persons strike 
a balance between feelings of superiority and inferiority or be- 
tween reactions typical of ascendancy and submission. These 
dichotom.ous or twofold classifications, which are so common in 
our everyday interpretations and generalizations, are not borne 
out by the test results The average, however, may not be exactly 
at the dividing point Heidbredei found that the normal person 
tended to answer more of her questions like the extroverts than 
like the introverts. It is more unusual to be introverted than 
extroverted Furthermore, even though a distribution is normal 
for other characteiistic diffeitnces, and few people are extreme, 
one type of adjustment is usually raicr than the other. 

Even in our attitudes and interests we tend to show a normal 
distribution Few boys are extremely studious, and few are ex- 
tremely the opposite — the majority are tolerably studious Most 
persons stiike a balance between the radical and the conservative 
We aie all radical on some issues and conservative on others. 
Since for most persons these tendencies balance up, it is the out- 
standing or peculiar person only who is marked as the ultra- 
radical or the ultra-conservative 

One exception to this was discovered by Allport* who studied 
attitudes by asking students to select from a submitted list the 
statement regarding an issue w'hich was most acceptable to them. 
He found that on some issues — prohibition, for example — there 
was a definite cleavage of opinion, with few taking a middle 
ground, such that they could be called neither wet nor dry m 
sentiment. 

But Thurstone,** using the same method, believes that this 

* Allport, F H, and Hartman, D A, “Measurement and Motivation of 
a Typical Opinion in a Certain Group,” American Political Science Review, 
19 735-760 (192s) 

*• Thurstone, L L, “Attitudes Can Be Measured,” American Journal of 
Sociology, 33 529-554 (Jan , 19*8). 
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characteristic in Allport’s results was due to his scale It was 
thick with Items in the middle and thin with items on the ends. 
That IS, Thurstone thinks, the form of the distribution of atti- 
tudes IS determined by the scale through the selection of items 
included in it There is just a touch of inconclusiveness in the 
argument here If we start with the assumption that the distribu- 
tion should be noimal, and then make up the scale to fit the as- 
sumption, naturally the results of using it will show a normal 
distribution. This problem is best answered by considering the 
distribution formed by submitting a long list of items chosen by 
random sampling Such distributions are usually normal. 

Sex differences. Most questionnaires, unlike tests, show distinct 
sex differences Women, for instance, have been found by Laird * 
to be more introverted than men Symonds ** found women stu- 
dents more conservative than men students Indeed, the differ- 
ences are so apparent in interest questionnaires that separate sets 
of questions must be assembled to measure adequately the in- 
terests of the two sexes — a finding markedly out of line with the 
results of tests, which seldom show clearly marked sex diffeiences 

Racial differences. The Woodworth Psychoneurotic Inventory 
has yielded distinct racial differences f Investigators, however, 
incline to the opinion that these differences, instead of indicating 
fundamental trends in racial constitution, are really an expression 
of varying mores, customs, and habits of thought and action, a 
conclusion which, if coirect, would imply that the results of any 
questionnaire of adjustment, interests, or attitudes must be in- 
terpreted in the light of an individual’s or a group’s background. 
Differences disclosed by questionnaires probably reflect the ef- 
fect of environment, particularly the social environment, rather 
than some inborn native tendency of the personality 
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Chapter V 

ADJUSTMENT QUESTIONNAIRES 


The Psychoneurotic Inventory and Its Derivatives 

A FORM of questionnaire for measuring psychoneurotic ten- 
dencies which has enjoyed extensive use, had its incep- 
tion in 1917 during the World War At that time a method 
was needed for diagnosing the ability of men to adjust them- 
selves satisfactorily to the stresses and strains of military life. 
Since it was obvious that the psychologists who were available 
could not possibly interview personally every drafted man, a 
paper-and-pencil questionnaire was drawn up by Woodworth, 
then chairman of the Committee on Emotional Fitness appointed 
by the National Research Council Woodworth studied the symp- 
toms of men who had difficulty in adjusting themselves to trying 
conditions, using such sources as McCuidy’s JFar Neuroses Over 
200 questions were assembled, each to be answered by yes or no. 
These were tried on Columbia College students and drafted men 
Questions for which the peicentage of “unfav^orable” replies was 
large were either omitted or “stiflened,” so that as a result of this 
preliminary experience the list was reduced to 116. Although 
the armistice came before this final list could be extensively 
studied, a report of its use in army hospitals is contained in H L 
Hollingworth’s Psychology of the Functional Neuroses We shall 
present Woodworth’s list of 116 questions, as it forms the basis 
of subsequent work. The unfavorable answers are italicized The 
score IS the number of unfavorable responses. This questionnaire, 
originally dubbed “Personal Data Sheet” in order to allay sus- 
picions as to Its real purpose, has been called in psychological 
literature the “Woodworth Psychoneurotic Inventory.”* (See ii, 
pp. 171-176) 

*Publr‘!hed by C H Stoelting Company From Franz, Handbook of Examina- 
tion Methods. By pcimissioli of The Macmillan Company, publishers. 

»74 
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Woodworth Psychoneurotic Inventory 


1 Do you usually feel well and strong? 

2 Do you usually sleep welD 

3 Are you frightened m the middle of the night ^ 

4 Are you troubled with dreams about your work^ 

5 Do you have nightmares^ 

6 Do you have too many sexual dreams^ 

7 Do you ever walk in your sleep? 

8 Do you ever have the sensation of falling when 
going to sleep ^ 

9 Does your heart ever thump in your cars so that 
you cannot sleep ^ 

10. Do ideas run through your head so that you can- 
not sleep ^ 

11 Do you feel well rested in the morning^ 

12 Do your eyes often pain you^ 

13. Do things ever seem to swim or get misty before 
your eyes^ 

14 Do you often have the feeling of suffocating^ 

15. Do you have continual itching in the face^ 

16 Are you bothered much by blushing^ 

17. Are you bothered by fluttering of the heart ^ 

18 Do you feel tired most of the time? 

19 Have you ever had fits of dizziness? 

20 Do you have queer, unpleasant feelings in any part 
of the body? 

21 Do you ever feel an awful pressure in or about the 
head? 

22 Do you often have bad pains in any part of the 
body? 

23 Do you have a great many bad headaches? 

24 Is your head apt to ache on one side? 

25 Have you ever fainted away? 

26 Have you ojten fainted away? 

27 Have you ever been blind, half-blind, deaf, or dumb 
for a time? 

28 Have you ever had an arm or leg paralyzed? 

29 Have you ever lost your memory for a time? 

30. Did you have a happy childhood? 

31 Were you happy when i4,to 18 years old? 

32 Were you considered a bad boy? 

33 As a child did you like to play alone better than to 

play with other children? • 

34 Did the other children let you play with them? 

35 Were you shy with other boys? , 

36. Did you ever run away from home? 


yes 

no 

yes 

no 

yes 

no 

yes 

no 

yes 

no 

yes 

no 

yes 

no 

yes 

no 

yes 

no 
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37. Did you evei have a strong desire to run away from 
home? 

38. Has your family always treated you right? 

39 Did the teachers in school generally treat you right ^ 

40 Have your employers gcneially treated you right ^ 

41 Do you know of anybody who is trying to do you 
harm? 

42 Do people find fault with you more than you de- 
serve? 

43 Do you make friends easily? 

44 Did you ever make love to a girl? 

45 Do you get used to new places quickly? 

46 Do you find your way about easily? 

47 Does liquor make you quarrelsome? 

48 Do you think drinking has hurt you? 

49 Do you think tobacco has hurt you^ 

50 Do you think you have hurt yourself by going too 
much with women ^ 

51 Have you hurt yourself by masturbation (self- 
abuse) ? 

52 Did you ever think you had lost your manhood? 

S3. Have you ever had any great mental shock^ 

54 Have you ever seen a vision? 

55. Did you ever have the habit of taking any form of 
“dope”? 

56 Do you have trouble in walking in the dark? 

57 Have you ever felt as if some one was hypnotizing 
you and making you act against your wilH 

58 Are you ever bothered by the feeling that people are 
reading your thoughts^ 

59 Do you ever have a queer feeling as if you were not 
your old self’ 

60 Are you ever bothered by a feeling that things are 
not real? 

61. Are you troubled with the idea that people are 
watching you on the street^ 

62 Are you troubled with the fear of being crushed in 
a crowd? 

63 Does It make you uneasy to cross a bridge over a 
river? 

64 Does It make you uneasy to go into a tunnel or 
subway? 

65 Does It make you uneasy to cross a wide street or 
open square? 

66 Does It make you uneasy to sit in a small room with 
the door shut? 

67 Do you usually know just what you want to do? 

68. Do you worry too much about little things? 
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69. Do you think you worry too much when you have 
an unfinished job on your hands ^ 

70 Do you think you have too much trouble in mak- 
ing up your mind? 

71. Can you do good work while people are looking 
on? 

72. Do you get rattled easily? 

73. Can you sit still without fidgeting? 

74 Does your mind wander badly so that you lose 
track of what you are doing ^ 

75 Does some particular useless thought keep coming 
into your mind to bother you? 

76. Can you do the little chores of the day without 
worrying over them? 

77 Do you feel you must do a thing over several 
times before you can drop it^ 

78 Are you afraid of responsibility^ 

79 Do you feel like jumping off when you are on high 
places ^ 

80 At night are you troubled with the idea that some- 
body IS following you'' 

81. Do you find it difficult to pass urine in the pres- 
ence of others^ 

82 Do you have a great fear of fire? 

83. Do you ever feel a strong desire to go and set fire 
to something? 

84. Do you ever feel a strong desire to steal things ^ 

85 Did you ever have the habit of biting your finger- 
nails? 

86. Did you ever have the habit of stuttering^ 

87. Did you ever have the habit of twitching your 
face, neck, or shoulders^ 

88. Did you ever have the habit of wetting the bed? 

89. Are you troubled with shyness? 

90 Have you a good appetite^ 

91. Is It easy to make you laugh? 

92. Is it easy to get you angry? 

93. Is It easy to get you cross or grouchy? 

94. Do you get tired of people quickly? 

95. Do you get tired of amusements quickly? 

96. Do you get tired of work quickly? 

97. Do your interests change frequently? 

98. Do your feelings keep changing from happy to sad 
and from sad to happy without any reason^ 

99 Do you feel sad or low-spirited most of the time^ 

100. Did you ever have a strong desire to commit 

suicide^ j 

101. Did you ever have heart-disease? 
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102 

Did you ever have St Vitus’s dance? 

yes 

no 

103 

Did you ever have convulsions^ 

yes 

no 

104 

Did you ever have anemia badly ^ 

yes 

no 

105 

Did you ever have dyspepsia’ 

yes 

no 

106 

Did you cvei have asthma or hay fever ^ 

yes 

no 

107 

Did you ever have a nervous breakdown^ 

yes 

no 

108 

Have you ever been afiaid of going insane^ 

yes 

no 

109 

Has any of your family been insane, epileptic, or 




feeble-minded ^ 

yes 

no 

no 

Has any of your family committed suicide^ 

yes 

no 

III. 

Has any of your family had a drug habit ^ 

yes 

no 

112 

Has any of your family been a drunkard? 

yes 

no 

113 

Can you stand pain quietly^ 

yes 

no 

1 14. 

Can you stand the sight of blood ^ 

yes 

no 

”S 

Can you stand disgusting smells? 

yes 

no 

1 16. 

Do you like outdoor life^ 

yes 

no 


A rough classification of these questions may be made as 
follows : 

Table 24 

CiAssmcATioir of Items m the Woodworth Psychoneotottc Iiti'entort 


Physical symptoms, pains, weariness, incoordinations . 28 

Adjustment with the environment 20 

Fears, worries . 16 

Unhappiness, unsocial and antisocial moods and conduct 16 
Dreams, phantasies, sleep disturbance , 10 

Reactions to drink, tobacco, drugs, sex 7 

Mental symptoms . . 6 

Vacillations . 5 

Compulsions 4 

Questions about one’s family . 4 


116 

Just as intelligence tests were found to be useful in classifying 
and grouping school children on the basis of ability, so, it was 
suggested, perhaps the Woodworth Psychoneurotic Inventory 
might be serviceable in identifying children who have difficulty 
in making school or home adjustments Accordingly in 1921 Miss 
Ellen Mathews (20) used the questionnaire with the purpose in 
mind of adapting it for school-children. The original Woodworth 
list was found unsuitable, however. Some of the questions were 
unintelligible to children Others caused too great embarrassment, 
restlessness, or tittering in the group, and in one private school for 
girls the use of the qbestionnaire caused considerable excitement 
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among the parents Mathews revised the questionnaire by drop- 
ping some questions and adding others, so that her list of seventy- 
fi\ e questions is one that may be used with children as young as 
twelve or thiiteen years of age"* 

Cady (5), compiling, in 1923, a gioup of tests to estimate 
juvenile incorrigibility, selected the Woodworth questionnaire as 
one of his battery He argued that piobably much juvenile de- 
linquency was due to imperfect adjustment to the demands of life 
and that the Woodworth questions should aid in revealing the ten- 
dency to make faulty or inadequate adjustments. Cady selected 
questions from the Woodworth questionnaire and from another 
revision by Johnson (16) and added twelve questions w'hich bear 
distinctly on the habits and expressions of the behavior of m- 
corrigibles Cady prepared an alternative form of his questionnaire 
by revising all questions so that an answer of yes in the first form 
would correspond to an answer of no in the second foim and vice 
versa. This enabled him to determine the consistency of response, 
and also the reliability of the questionnaire, furthermore, by 
using the combined result of both forms the reliability was in- 
creased. 

In Cady’s report he states that his revised questionnaire con- 
sists of fifty-nine questions Teiman (29) prints a list purporting 
to be the questionnaire in his Genetic Sttidies of Genius, Vol- 
ume I, containing eighty-five questions Twelve of these are 
inserted for padding to lull the suspicions of the subject as to the 
puipose of the test. This revision, known as the Woodworth-Cady 
questionnaire, is the best one to use for children in their early 
teens 

The need for an Instrument which will diagnose tendencies to 
make inadequate adjustments is especially needed on the college 
level It IS well known that many students, incapable of making 
satisfactory adjustments, aie in need of guidance in mental hy- 
giene. Laird (17), at Colgate Univeisity, appreciating this prob- 
lem, tried out the Woodworth questionnaire on the college level 
and eventually published his own “Personal Inventory,” a set 
of questions differing in some important particulars from other 
variations of the Woodworth Psychoneurotic Inventory In the 
first place, Laird realized that those answering the questionnaire 
were not simply answering questions, but^ were engaged in a 

* Published by C H Stoelting Company. 
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kind of self-rating Since such judgments of oneself are capable 
of more than a dichotomous division into yes and no answers, he 
arranged that each item of his inventory should be responded to 
on a graphic rating scale Second, he altered the scoiing On the 
basis of the replies of over 2,000 persons a distribution of the 
answers to each question was found The quartile point toward 
the unfavorable end of each distribution was determined, and 
this quartile section was arbitrarily used as signifying that the 
reaction was unfavorable Stencils were so prepared that a count 
could quickly be made of the number of responses that lay in 
the unfavorable area. 

Third, Laird separated his questions into finer groups, called 
(I) Psychasthenoid (thiity-two items), (II) Schizoid (fourteen 
items), (III) Neurasthenoid (twenty-two items), and (IV) Hys- 
teroid (seven items) — total, seventy-five items Group I contains 
questions relating to obsessions, morbid fears, and doubts An 
extreme form represented by those who answer a laige number 
of the questions in group I unfavorably or who have a par- 
ticularly acute form of one or more symptoms is known as 
psychasthenia Group II contains questions relating mainly to 
social adjustment, and inteiest in persons and things The im- 
plication IS that persons who manifest poor adjustments of this 
type indulge excessively m day-dreams or phantasy thinking 
and hence have weakened their contacts with environmental 
realities. This represents an abnormal tendency which m the 
extreme leads to schizophrenia or dementia praecox Group III 
contains questions that refer to physical conditions, especially 
aches, pains, and discomforts, fatigability, and anxiety about the 
health These questions represent a tendency which, if extreme, 
is known as neurasthenia Group IV is composed of questions 
relating to more pronounced physical disabilities such as fainting, 
paralysis, convulsions, loss of memory, etc An extreme form of 
this tendency is known as hysteria. A revised form of the Col- 
gate Personal Inventory omits the questions in Group IV. 

House (15) also, on the basis of experiments with the Wood- 
worth Psychoneurotic Inventory in colleges, presents a revision 
in which the items are chosen for power to differentiate between 
known normal and psychoneurotic subjects In the final form, 
called the “Woodworth-House Mental Hygiene Inventory,” * he 

•Published by C. H. Stoelting Cgmpany, 
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includes one innovation It has long been a hypothesis of those 
who make case studies of mental abnormalities that such mal- 
functioning has Its roots in childhood experiences. The psycho- 
analysts very definitely state that maladjustments in maturity may 
be traced to unfortunate childhood experiences. House, therefore, 
includes twenty-five questions relating to experiences which oc- 
curred before the age of fourteen, as well as fifty questions relating 
to experiences which occurred after the age of fourteen In this 
questionnaire the general heading “This problem has occurred in 
my life” is followed by a list of items each of which is a brief 
statement of a personality adjustment problem Each item is to be 
checked in one of three degrees extreme, moderate, and no. 

J. O. Chassell (6) in collaboration with Goodwin Watson, has 
constructed an “Experience Variables Record” which is an out- 
growth of the Woodworth questionnaire. The innovations in this 
form are (a) an emphasis on the type of situation in which the 
maladjustment occurs rather than on the type of response itself; 

(b) an extension of the rating idea by describing degrees of the 
presence or absence of the variables in the questionnaire; and 

(c) an attempt to obtain data on childhood experiences as well 
as on present irritating situations The record has twelve main 
sections : 


I Mother Relationships 

2. Father Relationships 

3. Relationship with Biothers 
and Sisters 

4. Family Life 

5. Religion and Standaids 

6. Sex Development 

7. Love-Affairs — Crushes and 
Heterosexual Adjustments 


8 Physical Development 

9 Intellectual Development 

10 Vocational Adjustment 

II. Social Situation — Adj'ust- 
ment in Comrade Groups, 
Status m Community, Pub- 
lic Recognition 

12 General Emotional Adjust- 
ment, Happiness, etc. 


These sections, Chassell believes, represent “the maj’or situa- 
tions in our society in which the growing youth finds it necessary 
to make adjustments and develop the more stable attitudes which 
constitute the framework of his character.” 

The questions in each section are divided into four groups as 
follows : 


Group I Environmental situations: general background. 
Group II Environmental factors bearing more directly upon 
the individual subject. . . 
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Group III Subject’s responses, his habits, interests, tendencies 
Group IV Pioblcms of adjustment and more “difficult situa- 
tions” encounteied by later adolescents 

Chassell and Watson, tvc repeat, place most of the emphasis on 
the situation, devoting less attention to the peculiar mode of re- 
sponse that the individual may have developed In the blank they 
provide space after neaily every scale for checking answers for 
childhood, eaily teens, and lecent or present times 

As a sample of the giaded desciiptive scale to be used in an- 
swering each question, the following is taken fiom the Mother 
Relationship Section. 

Her tendency to exhibit physical affection toward subject. 

Very demonstrative, maiked exhibition of affection 
Steady geneial affection — consistent but not maiked 
Physical demonstrations on rare occasions — eve of departure, 
etc. 

Very undemonstrative — subject doesn’t ever lemember sitting 
on her lap. Affection indicated by acts, solicitations, etc Never 
kissed husband or children 


A total score is not computed, since the “Experience Variables 
Record” is intended primarily for the use to be made of the 
answers to specific items. 

The present author (27), acting on the hypothesis that adjust- 
ment IS specific according to the stress of the situation in which a 
person finds himself, bi ought together a series of 175 questions, 
divided into the seven following sections, to form an “Adjustment 
Questionnaire” for use with high school pupils. 

Table 25 

Sections in Symonds Adjustment Questionnaire 
( 27, p 322) 

No of items 


Adjustment in Relation to the Curriculum 24 

Adjustment in Relation to Social Life of the School . 23 

Adjustment m Relation to the Administration 14 

Adjustment in Relation to the Teachers .... 33 

Adjustment in Relation to the Other Pupils 33 

Adjustment m Relation to the Home and Family 36 

Adjustment in Relation to Personal Affairs 12 




Total numbers of items 
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By tabulating the responses to each item made on the fifty 
highest-scoring papers out of a total of 162 and the fifly making 
the lowest scoie, the most significant items were found to be 
(27, P 325). 

Are you given a chance to tell or show what you know in your 
classes^ 

Do you think theie should be more try-out or optional classes? 
Are you required to take subjects that you dislike^ 

Would you select another teacher in any of your subjects if 
you were permitted to^ 

Do your teachers icquiie too much homework^ 

Do your teachers make the assignments too long^ 

Do you like examinations in school'’ 

Do your teachers usually understand your difficulties? 

Do all of your teachers tieat you as a friend^ 

Do any of your teachers have a W'long opinion about you? 

Are you ever punished for things you do not do^ 

Do any of your teachers maik examinations too severely^ 

Do you feel that you are making quite a success of the things 
that you do^ 

Do you often fail in the subjects that you dislike^ 

Do all of your teachers make the assignment clear^ 

Do your teachers praise you when you hand in good work? 

Do you think that any of your teachers are too strict^ 

Aie you doing as much or as well in school as your parents ex- 
pect you to do^ 

Do you think your work this year is rather monotonous? 

Practically all of these items have to do with a pupil’s success 
in school and out, since one may assume that being successful and 
being appreciated by others really do constitute the most im- 
portant factors in determining the quality of adolescent adjust- 
ments 

For the time being, a high point in the development of adjust- 
ment questionnaires has been reached by the “Personality 
Schedule” developed by L L and T G Thurstone (30) This ex- 
tensive inventory of 223 questions is drawn from the work of 
Woodworth, House, Laird, Freyd, and. Allport From the study 
of 694 University of Chicago freshmen the following items were 
determined to be the most differentiating: 

Do you get stage fright? 

Do you have difficulty in starting a conversation with a 
stranger? 
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Do you worry too long over humiliating experiences? 

Do you often feel lonesome, even when you are with other 
people? 

Do you consider youiself a lather ncivous person'* 

Are your feelings easily hurt^ 

Do you keep in the background on social occasions^ 

Do ideas often run through your head so that you cannot sleep ^ 
Are you frequently burdened by a sense of remorse^ 

Do you tvoriy over possible misfortunes^ 

Do your feelings alternate between happiness and sadness with- 
out apparent reason? 

Are you troubled with shyness? 

Do you day-dream frequently^ 

Have you ever had spells of dizziness^ 

Do you get discouraged easily^ 

Do your inteiests change quickly? 

Aie you easily moved to tears'* 

Does It bother you to have people watch you at work even when 
you do It welP 

Can you stand criticism without feeling hurt^ 

Do you have difficulty in making friends ^ 

Are you troubled with the idea that people are watching you 
on the street? 

Does your mind often wander badly so that you lose track of 
what you are doing? 

Have you ever been depressed because of low marks in school? 
Are you touchy on various subjects^ 

Are you often in a state of excitement^ 

Do you frequently feel grouchy? 

Do you feel self-conscious when you recite in class? 

Do you often feel just miserable^ 

Does some particular useless thought keep coming into your 
mind to bother you ^ 

Do you hesitate to volunteer in a class recitation? 

Are you frequently in low spirits'* 

Do you often experience periods of loneliness? 

Do you often feel self-conscious in the presence of superiors^ 
Do you lack self-confidence^ 

Do you find it difficult to speak in public^ 

Do you often feel self-conscious because of your personal ap- 
pearance? 

If you see an accident, are you quick to take an active part in 
giving help? 

Do you feel you must do a thing over several times before 
you leave it? 

Are you troubled with feelings of inferiority? 

Do you often find that you cannot make up your mind until 
the time for action -has passed? 
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Do you have ups and downs m mood without apparent cause? 

Aic you m gencial self-confident about your abilities^ 

Reliability. It is said that Woodwoith obtained a leliability of 
.90 for his original questionnaire of 116 items The present writer 
reports reliabilities of 90 and 84 in the Adjustment Questionnane; 
Thurstone, a reliability of 946 for his Peisonality Schedule. 
Mathews reports 667 between the split halves, and 369 (boys) 
and 697 (girls) on letests Cady lepoits a reliability of 55 on his 
original list and 49 and 47 on the revised list House obtains 
correlations of 714 and 845 on letests Hoitsma, in lepoiting on 
tlie Colgate Mental Hygiene Test, gives reliability coefficients of 
.85 for a repetition of the questionnaire and .79 for the split halves. 
Chassell reports the following coefficients in terms of Pearson C 
for the consistencies in answers between first and second responses 
to separate items in his questionnaire repeated after an interval of 
five to eight weeks 

Table 26 

Reliability CoErnciENTS (Pearson C) tor Chassell Experience 
Variables Record 




7 S 

JS women 

Childhood . 


67+ 

722 

E.irly teens 

• 

641 

719 

Recent or present 


688 

700 

A\ erage 

• • • 

668 

714 


Chassell finds adults as consistent in answers regarding their 
childhood as legarding their present condition One thing seems 
evident from these figures, the consistency of the questionnaire is 
greater for persons of maturity than for children. It seems reason- 
able to expect reliability coefficients of 50 to 60 from children 
twelve to fifteen years of age, of 70 to 80 from college students; 
and above .80 from still more mature individuals. This increase in 
reliability with increasing age may be explained as the growth of 
the ability of individuals to diagnose and judge their own states. 
Children are not used to noticing their own states and adjust- 
ments, and consequently fluctuate and give unreliable answers 
when asked about them Because mature persons usually have 
given more thought to their adjustments, they are in a better 
position to report on them. Another reasdn for the lower re^ 
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liabilities reported for younger subjects is the matter of reading 
difficulty Whcie cliilJieii do not uiidei stand the questions, their 
responses become moie of a chance matter, and this tends to 
attenuate the reliability The psychoneurotic inventory loses its 
usefulness below the high school level 

Validity. The Woodworth Psychoneurotic Inventory was origi- 
nally intended for use in sorting men accoidmg to their ability 
to make satisfactory adjustments under stress It might be called 
a measure of adaptability, meaning by adaptability the capacity 
to adapt oneself m satisfactory ways to unfamiliar and trying 
circumstances That the questionnaire does this is attested to 
by several bits of evidence Woodworth himself says that in a 
group of psychoneurotic patients the average score (unfavorable 
responses) was 36, while the average score for normal individuals 
was 10. Mathews (20) reports correlations of 515 and 663 be- 
tween scores on the test and judgments by teachers of nervous 
stability House (15) finds that there is a distinct difference in 
scores made by normal college students and psychoneurotic pa- 
tients. 

On the hypothesis that many of the behavior problems of chil- 
dren arise from poor adjustments of the type indicated by the 
psychoneurotic inventory, several investigators have used the 
questionnaire as a measure of juvenile delinquency, obtaining re- 
sults which support the surmise Cady (5) reports correlations of 
41, .42, and 36 with teachers’ estimates of incorrigibility, which, 
in view of his rather low reliability coefficients, indicates a pro- 
nounced relationship. Cushing and Ruch (7) report a bisenal r of 
60 on the questionnaire between a group of fifty delinquent girls 
and fifty normal girls Slawson (25) finds that 844 per cent of 
delinquent boys make higher 5 ,cores than the median of normal 
boys as measured by Mathews Bridges and Bridges report that 
delinquent boys make an average of twenty-one unfavorable re- 
sponses on the Mathews revision of the questionnaire against an 
average of nine found by Mathews for normal subjects. 

Miscellaneous findings on the questionnaire help us to interpret 
its significance Mathews reports a decrease in score with age, one 
reasonable explanation of which is that the unreliability of the 
questionnaire for children has a tendency to throw the unfavor- 
able resporises toward 50 per cent, which automatically raises the 
score. Hollmgworth ^’14) found a large drop in the score on the 



Adjustment Questionnaires 187 

questionnaire Immediately after the armistice that ended the 
Woild War, indicating that the tendencies which the questionnaire 
signifies are functional in nature and tend to be aifected by the 
present irritating condition rather than by some past ciicumstance. 
Both Everett (9) and Mathews (20) report higher scores among 
girls than among boys. 

This questionnaire has revealed distinct racial differences One 
study (i) credits Poles with the highest scores, followed in order 
by the French, Czechs, and Americans Another study showed 
negroes making the highest scoies, with Italians, Hebrews, and 
Americans following The interpretation here need not be that 
races have inherited tendencies toward maladjustment, but that 
the questionnaire reflects differences in the environment, general 
conditions of living, racial traditions, etc 

Landis, Gullette, and Jacobsen (19) attempted to determine 
the degree to which the Psychoneuiotic Inventory measures emo- 
tionality The correlation of tlie questionnaire with ratings of 
emotionality was 31, with ratings of emotional stability 21, and 
with ratings of expressiveness — ii. In the same study the corre- 
lation of the questionnaire with speed of tapping was 57 and with 
vocabulary test 44 The questionnaire has apparently only slight 
If any reference to emotionality 

Garrett and Kellogg (12) (reversing earlier findings of Nac- 
carati and Garrett (22)) find no relation between the morphologic 
index and the Psychoneurotic Inventory. 

Mathews (20) found that the questionnaire correlates — .201 
with the IQ for boys and — 055, — 425, and — 592 for girls 
Holtsma (13) reports a correlation of the Colgate Mental Hygiene 
Questionnaire with the Thorndike Intelligence Test for High 
School Graduates of 008, also with scholarship of 074. Thur- 
stone (30) finds a correlation of -)- 037 between the Personality 
Schedule and the American Council Intelligence Test for Uni- 
versity of Chicago freshmen. 

Bridges (2), in a stimulating study using the Woodworth ques- 
tionnaire on college students, concludes that college students show 
poorer adjustment than the general population The women stu- 
dents are less well adjusted than the men students. The typical 
student psychoneurosis is an anxiety neurosis, symptoms of hys- 
teria, psychasthenia, and the major psychoses are rare The most 
frequent symptoms are disturbed sleep, worry, irritability, per- 
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severalion of ideas, and self-consciousncss. There is no correlation 
with intelligence 

Both House (15) and Chassell (6) agree in finding a marked 
relationship between the items answered for the present and those 
answered for childhood House makes his correlations on the basis 
of total score on his inventorv and finds correlations of .705 and 
803 between reported childhood reactions and present reactions. 
Chassell treats his items singly, using Yule’s Q, and finds signifi- 
cant relations between items These relationships should not be 
taken too seriously However much we believe that present mal- 
adjustments are due to habits carrying over from childhood ex- 
periences, the relationships disclosed by these investigators may 
be explained more easily. Subjects answering the questions may 
actually believe that their present troubles began as a consequence 
of childhood experiences and so make their memories testify to 
their belief Or, their memories being weak, they may not be able 
to make very sharp distinctions between present maladjustments 
and previous conditions Answers to questions about childhood 
experiences have dubious value in any case. 

In summary it can be said that the Woodworth Psychoneurotic 
Inventory and its later revisions have been found to indicate 
roughly the degree to which a person is making poor adjustments 
with irritating and difficult conditions of living Various writers 
emphasize the fact that the invcntoiy shows tendencies only and 
that persons with high scores should be subjected to a more 
thorough clinical examination Others believe that a scrutiny of 
the individual items on the questionnaire may be used to make a 
diagnosis of the individual’s peculiar form of maladjustment The 
Woodward questionnaire has also been found to indicate behavior 
difficulties and has been suggested as a test of delinquent ten- 
dencies. 

Further work with this type of questionnaire will undoubtedly 
follow along more special lines. Now that the general usefulness 
of this type of questionnaire has been demonstrated, effort should 
be made to use similar sets of questions to diagnose particular 
kinds of maladjustment In the past psychopathology has limited 
Its classifications to types of responses. But the more significant 
grouping may be around the types of situations that cause mal- 
adjustments Different individuals find various methods of ad- 
justing themselves to the same annoying situations. 
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Pressey X-O Tests 

In 1919 Pressey (46) announced the construction of a new 
type of questionnaire which has subsequently had rather ex- 
tended use In Pressey’s oiiginal statement concerning his test 
he seems more enthusiastic about tlie form of the test than about 
its content At that time he believed that test-makers were con- 
structing their tests so that they were needlessly artificial He hit 
upon the scheme of inserting in a test list irrelevant elements 
which could be crossed out by the person taking the test. This 
could be applied to all forms of testing — intelligence, learning, 
reading, ingenuity, etc , as well as to tests for obtaining responses 
relative to beliefs, interests, and likes and dislikes The feature 
of the test — ciosstng out irrelevant terms — has been symbolized 
in the name X-O tests. 

In the oiiginal “Gioup Scale for Investigating the Emotions,” 
(45) five different “tests” weie included In a revision, completed 
in 1920, theie weie changes in the lists and four different “tests” 
were included One is a “test” m which the subject is instiucted 
to cross out woids whose meaning is unpleasant to him. Four 
types of unpleasantness are recognized in this test and words are 
included to diagnose abnoimal feais, disgusts, abnormal sex ten- 
dencies, and abnoimal self-iegard 

The second “test” consists of words to be crossed out which are 
associated with a key woid at the beginning of the line. This is a 
variation of the free association test 

In a third “test,” words are to be crossed out which represent 
things that the subject considers wrong, making it a kind of ethical 
discrimination test. 

In a fourth “test” words are to be crossed out which refer to 
things about which the subject has ever worried, the material 
being taken laigely from Woodworth’s Psychoneurotic Inventory. 
The woids in each “test” have been carefully selected on the basis 
of experience with psychopathic patients. 

Some difficulty was experienced in using these X-O tests in the 
public schools because of the unwillingness of teachers to use a 
“test” including words relative to sex and sexual conduct Accord- 
ingly Test I containing words whose meaning is unpleasant, and 
Test 2 in which words associated with certain key words are to 
be grossed out were omitted, and a new “teat” known as Form B 
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was substituted in which the subject is instructed to cross out 
everything he likes 01 is interested in All “tests” in this form 
consist of twenty-five lines of material 

The thiee “tests” begin as follows (34, p 304) . 

Test I 

Directions — Read through the twenty-five lists of words given 
just below and cross out everything that you think is wrong — 
everything that you think a person is to be blamed for You may 
cross out as many or as few woids as you like, in some lists you 
may not wish to cross out any words. Just be sure that you cross 
out everything you think is wrong 

I begging, smoking, flirting, spitting, giggling, 

2. fear, anger, suspicion, l azin ess, contempt, 

3 dullness, weakness, ignorance, meakness, stinginess, 

4 fussiness, recklessness, silliness, nagging, fibbing, 

5 extravagance, sportmess, boasting, deformity, talking-back. 

Test 2 

Directions — Read through the twenty-five lists below and cross 
out everything about which you have ever worried, or felt nervous 
or anxious. You may cioss out as many or as few words as you 
like, there may be some lines in which you may not wish to 
cross out any. But be sure you cross out everything about which 
you have ever worried. 

1 loneliness, work, forgetfulness, school, blues; 

2. sin, headache, fault-finding, sneer, depression, 

3. meanness, clothes, sickness, looks, unfairness, 

4. discouragement, self-consciousness, failure, accidents, worry; 

5. temper, disease, pain, money, awkwardness. 

Test 3. 

Directions — Read through the twenty-five lists just below and 
cross out everything you like or are interested in You may cross 
out as many or as few words as you wish, there may be some 
lines in which you will not wish to cross out anything. But be 
sure you cross out everything that you like. 

I. fortune-telling, boating, beaches, mountains, vaudeville; 

2 camping, tennis, hiking, eating, amusement parks, 

3. Beethoven, Edison, Napoleon, Raphael, Tennyson, 

4. kissing, flirting, pretty girls, talkative girls, athletic girls; 

5. studying, dancing, day-dreaming, walking, reading. 
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Three hundred and seventy-five items are included in this form, 
which coveis only thiee pages Tv\eiiLy iiiiiiuLes’ woiking time is 
allowed in the case of college students 

In the original “scale,” Pressey used two methods of scoring, 
one of which he calls the “affectivity” or “emotionality” score; 
the other, the “idiosynciasy” scoie The “affectivit/’ score was 
obtained by merely totaling the number of words ciossed out. In 
determining the “idiosyncrasy” score, the modal word ciossed out 
in each line was first determined (in a group of 114 college stu- 
dents) The “idiosynciasy” score is the number of words crossed 
out which are not the modal word foi each line 
Pressey definitely presents these X -0 tests as research instru- 
ments and not as finished pioducts mcasuiing anything in par- 
ticular. In his original statement he spoke of an intention to 
measuie “emotional instability,” but this original aim was not 
kept foremost in subsequent revisions Pressey says (45, p. 61), 
“The scores on the entiie examination are the blurred result of a 
number of factors, and are of relatively little impoitance How- 
ever, It IS possible, from the mass of data yielded by the ex- 
amination, to combine certain items in such a way as to obtain, 
from the single examination, highly differential information with 
reference to a number of pioblems ” 

McGeoch and Whitely (41) have made an elaborate study of 
the reliability of the Pressey tests, using as subjects sophomores 
in Washington Univeisity Three groups were given the tests with 
repetitions after forty-eight hours, forty-five days, and ninety 
days, respectively. Table 27 on the following page taken from 
their article tells the stoiy 

The reliabilities for the forty-eight-liour interval are fairly satis- 
factory, but reliabilities tend to fall off for the longer periods. This 
IS explained by the mvestigatois as due to an actual change in 
“emotional oigamzation ” 

The general impression which one receives from studying the 
reports of the use of the Pressey X -0 tests is that the tests as 
a whole are a composite of several different kinds of tasks and 
that as a result the total affectivity score or the idiosyncrasy score 
does not measure any one thing with much success Pressey 
frankly states in his announcements that he has assembled a mis- 
cellaneous group of material for experimental study. Another im- 
pression one receives is that such a high premium is placed on 
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Table 27 

Pressey X-0 Tests — ^Reiitbility Coeeficients 
(rrom McGeoch and IthitcK, 41, pp 262-264) 


AFFECTTVITY SCORES 


N 

r Interval 

Test I 

Test 2 

Test 3 

Test 4 

64 

48 hours 

8S 

86 

82 

87 

48 

4S days 

S8 

74 

80 

75 

70 

90 days 

70 

67 

65 

SI 



IDIOSYNCRASY 

SCORES 




Interval 

Test I 

Test 2 

Test 3 

Test 4 


48 hours 

70 

55 

77 

43 


45 days 

S3 

28 

46 

52 


90 days 

46 

43 

45 

53 


CLASSIFICATION SCHEME FOR 

TEST I 


Interval 

Disgust 

Tear 


Sex 

Self-feeling 

48 houra 

88 

82 


90 

77 

45 days 

39 

57 


55 

71 

90 days 

76 

.63 


60 

.70 


CLASSIFICATION SCHEME FOR 

TEST 4 







Hypo- 

Interval 

Paranoia 

Neurotic 

Shiit-m 

Melancholia 

chondri, 

48 hours 

80 

74 

87 

8z 

84 

45 days 

79 

76 

79 

74 

77 

90 days 

S7 

31 

59 

49 

59 


undei Standing the vocabulary of the tests that responses are 
apt to be more oi less chance for dull or even noimal adults 
and for childien. Budges and Bridges (32), for instance, re- 
poit that test i behaves much like an intelligence test Tjaden 
(49) lepoits that the scores on the Piessey test coirelate with 

IQ 

The fact that the “test” bears the title “Group Scale for In- 
vestigating the Emotions” has led many peisons to assume that 
It is a measure of “emotionality” or something similar Landis, 
Gullette, and Jacobsen (39, p 225) in their study in emotionality 
say, “Affectivity fails to give any significant correlations” and “It 
IS hard to see why this idiosyncrasy rating should give other than 
chance correlations with other factors The score here is based 
on the number of times the subject fails to agree with the dislike, 
association, blame, or worry which the modal person of a limited 
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standardizing group gave The idea back of this test — namely, 
that the person who does not agice with the favorite worries or 
blames of his community is odd or peculiar — is coiiect But 
in practice we belie\e it is impossible to standardize the test for 
all groups or communities The woriies, blames, and dislikes which 
an individual possesses aie largely the product of his enviionment 
The city man has one set of beliefs and standards, and the farmer 
another, and so on fiom one group to another. . . We are of the 
opinion that the Pressey test embodies several very worth-while 
ideas, but should have a thorough revision and new standardiza- 
tion ” 

Bridges and Bridges (32) and Lentz (40) have studied the re- 
actions of delinquents to the Pressey X -0 tests All three agree 
that the Piessey tests do not differentiate, so far as score goes, 
between delinquents and non-dchnquents But still it was evident 
that delinquents and non-delinquenls made different responses on 
sepal ate items Bridges and Budges, foi instance, found that de- 
linquents consideied fewer things wrong than college students, and 
likewise that they had more worries and fewer interests. The 
Woodwoith-Mathcws qucstionnaiie is positively coriclated with 
Pressey worries and interests, and negatively with Pressey things 
considered wrong 

Weber and Guilford (50) diaw similar conclusions with regard 
to the use of the Pressey tests with criminals that while they 
show marked diveigence from the norm on separate elements, the 
total scores are not diagnostic of ciiminal tendencies 

Naccarati and Gariett (43) find that the Pressey test has no 
relationship with the moiphologic index 

These negative results should be contrasted with the positive 
findings of Chambeis in using the Piessey tests to measure “emo- 
tional matuiity” (33) and “college achievement” (34) Chambers’ 
plan in both of these investigations was to study the differences 
in responses of groups known to differ markedly in a particular 
quality being studied. In the case of emotional maturity, for 
instance, he studied the responses of a group of 166 pupils in the 
sixth and eighth grades, and another group of 196 pupils in the 
tenth and twelfth grades The responses of each group were re- 
corded in terms of percentage of items marked, and all items in 
which the groups showed a diffeience of 15 per cent or over were 
taken to be significant of emotional maturity. Ninety-four such 
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words constituted a differential unit, and the papers of these 
gioups as well as additional gtoups fiom the fourth grade, the 
sixth grade, and college weie rescoied on the basis of these ninety- 
four Items These gioups showed decicasing scores with increasing 
maturity, a finding wdiich leads Chambers to claim that he has a 
measuie of “emotional” development He suggests that a child 
showing a score of twenty points’ deviation on either side of the 
median for his group may be suspected of emotional maladjust- 
ment However, one would have to know with what else this dif- 
ferential unit con elates before such a sweeping statement could 
be accepted It may be suspected that part of the decrease in 
things considered wrong and things worried about among more 
matuie pupils may be due to the better discernment which comes 
with intellectual development. 

Chambers, using a similar technique, studied the words which 
showed a differentiation in response among college students with 
lecoids of good and poor achievement. The significant words are 
given below (34, p 307) 


Test I (Things Considered Wrong) 


Words marked more often by 
good students 

sportiness 

war 

king 

toughness 

betting 

shame 

nerve 

bribery 

craps 

overeating 


Words marked more often by 
poor students 

fear 

socialism 

day-dreaming 

slowness 

spending 

bashfulness 

absent-mindedness 

pedler 

union 


Test II (Things Worried About) 


Words marked more often by 
good students 

books 

self-consciousness 

accidents 

rivals 

parties 


Words marked more often by 
poor students 

work 
failure 
police 
wrecks 
' dreams 
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Test III (Things Liked) 


Words maiked more often by 
good students 

mountains 

leadeis 

camping 

Napoleon 

day-dreaming 

reading 

teaching 

Mowgli 

D’Artagnan 

teachers 

books 


Words walked more often by 
poor students 

dancing 
smoking 
rough boys 
rich boys 
society 
bargains 


Chambers reports correlations as follows for fifty-seven cases 
( 34 . PP- 308, 309) : 

Table 28 

CoRBELATIOKS BETWEEN PrCSSEY X -0 TeST AMD INTELLIGENCE AND 


School Marks 
(from Chambers) 

r grades — ^X-0 46 

r grades — intelligence .... S3 

r X- 0 — intelligence 51 

r grades — X- 0 , Part I 16 

r grades — X- 0 , Part II 00 

r grades — ^X- 0 , Part III 42 


We may conclude from the array of evidence that the Pressey 
X-0 test contains material which may be of service m diagnosing 
various conduct trends But exactly which elements these aie can 
only be discovered by the type of analysis used by Chambers 
Total scores on the Pressey X-0 tests are such a complex of vari- 
ous sorts of reactions as to be indicative of nothing that can be 
simply and usefully designated. 


Questionnaires to Measure Introversion-Extroversion 

\ 

The questionnaire has also been found to be a convenient in- 
strument for measuring the differences between individuals in 
respect to introversion-extroversion. In our discussion of the 
Woodworth Psychoneurotic Inventory, it was noted that whereas 
it was easy to describe certain outstanding sttuattons that are 
baffling and vexing and that cause maladjustment, it was less 
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satisfactoiy to test the tendency to poor adjustments by asking 
subjects about the } espouses themselves, because dilTerent indi- 
viduals have such vaiied methods of making these emergency 
adjustments Now it can be added, howc\ci, that these icsponses 
can all be examined with respect to one particular feature of 
adjustment Some individuals, who tend to turn themselves out- 
ward in their attempts at adjustment and to make vigorous even 
though ineffectual icsponses, aie chaiactenzed by an abundance 
of energy and zeal and lestlessness quite out of proportion to 
the lesulis that then cfToits bring Other individuals, turning 
inward and tending to withdraw their contacts fiom the outside 
world, attempt to find their satisfactions m their owm leibal and 
implicit reactions The tendency to make one or the other of these 
types of adjustments may be discovcied by certain well-placed 
questions The distinction between these tw'o types of adjustment 
in annoying situations was first elaborated by Jung m his Psycho- 
logical Types. 

Laud (67), Maiston (70), and Heidbreder (63) have severally 
but so nearly simultaneously published accounts of their efforts to 
measure intioversion and extroversion that it is difficult to assign 
the Cl edit of being first to any one of them Pei haps all these 
attempts are really derivatives of the Woodwoith Inventory, 
because certain of the questions asked in the introveision-extro- 
veision questionnaires were also used by Woodw'orth in his in- 
ventory Woodwoith, for instance, asked, “Do you make friends 
easily’” and Laud uses a similar question 

Laird (67) in the oiigmal Colgate Personal Inventory, had a 
section C-i of flfty-thiee questions for measuring introversion- 
extroversion The technique of the graphic rating scale, aheady 
described on page 158 for section B-i of the Colgate Personal 
Inventory, w'as used Laud, finding that the introversion-extio- 
version division of the Personal Inventory was the most valuable 
section, subsequently subjected it to thorough revision Out of 
over TOO questions alleged to be indicative of introversion-extro- 
version, Laird and his workers selected forty-one items as valid. 
These have been incorporated m revised forms of the Personal 
Inventory known as C-2, in which the student rates himself, and 
C-3, in which the student is rated by instructors or by friends 

A slight change has been made in the form of the scale in 
C-2 and C-3 Instead of using an unbroken line for the graphic 
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rating, the line has been biokcn into ten segments The scoring 
key indicaLes how many segments are included in the area which 
may be definitely called intiovcision on the basis of a method 
similar to that employed in the fust edition The upper quartile 
was found from the distiibution of 1 espouses for each item ob- 
tained from individuals who rated themselves on the “Inven- 
tory^,” and the critical segment for the scale of each question to be 
included in the scoring key was that which corresponded to the 
quaiter of most unfavorable answers Laird states that this dich- 
otomous method of scoiing correlates 88 with the score obtained 
by assigning ct edits according to the fifth of each scale in which 
the rating falls He piefeis this new method both because of 
Its simplicity and because he is really interested in spotting indi- 
viduals with high introversion tendencies Scoies might also be 
obtained by detci mining whcthei an individual’s rating lies be- 
yond a critical point on the extrovcision end of each question 
scale. Land states that such extroversion scores correlate —.92 
with the introvcision scores 

Heidbrcder (63) took Freyd’s collection of fifty-four specific 
characteristics of introveision and worked them up into a ques- 
tionnaire Each Item was to be rated by a -t- if the characteiistic 
was one which an individual possessed, by a — if the characteris- 
tic did not apply, and by a ^ if the individual was neutral with 
respect to the chaiacteiistic In her investigation. Miss Heid- 
breder not only had students answer the questions with regard 
to themselves, but had each student sccuie ratings on himself by 
two of his fiiends Each item was studied with respect to its rela- 
tion to the total, and it was found that there was a high consistency 
between items In the list which follows, the first thirty-one 
Items are diagnostic both according to self ratings and associates’ 
ratings, while the last si.x, although not diagnostic statistically, 
show a diiection tendency in harmony with the list as a whole 
The Items are in order of diagnostic value (63, pp. 129-13 1). 

Characteristics of 
Introversion 

I. Limits his acquaintances to a select few. 

2 Feels hurt readily; apparently sensitive about remarks or 
actions which have reference to himself. 

3 Is suspicious of the motives of others. * ^ _ 



198 Diagnosing Personality and Conduct 

4. Worries over possible misfortunes. 

5 Indulges m self-pity when things go wrong. 

6 Gets rattled easily, loses his head in excitement or moments 
of stress 

7. Keeps in the background on social occasions, avoids leader- 
ship at social affaiis and entertainments. 

8 Is critical of others 

9. Prefers to woik alone rather than with people, prefers to 
work at tasks that do not bring him into contact with people. 

10. Has ups and downs in mood without apparent cause. 

11 Is meticulous, is extiemely neat about his dress and pains- 
taking about ins personal property. 

12 Blushes frequently, is self-conscious 

13 Pays serious attention to rumors. 

14 Expresses himself better in writing than in speech. 

15 Resists discipline and orders 

16 Limits his acquaintances to members of his own sex 

17 Avoids all occasions for talking before crowds Finds it diffi- 
cult to express himself 

18 Is a radical, wants to change the woild instead of adjust- 
ing himself to it 

19 Is outspoken, says what he considers the truth regardless of 
how others may take it 

20 Introspects, turns his attention inward toward himself. 

21 Prefers participation in competitive intellectual amusements 
to athletic games 

22 Is strongly motivated by praise. 

23. Day-dreams 

24 Is selfish 

25 Dislikes and avoids any process of selling or persuading 
any one to adopt a certain point of view (except in the re- 
ligious field) 

26 Is sentimental 

27 Prefers to read a thing rather than experience it. 

28 Is extremely careful about the friends he makes, must know 
a person pretty thoroughly befoie he calls him a friend. 

29 Shrinks from actions which demand initiative and nerve. 

30 Prefers to work things out on his own hook, hesitates to 
accept or give aid. 

31 Talks to himself. 

32 Derives enjoyment from writing about himself. 

33 Keeps a diary. 

34 Shrinks when facing a crisis. 

35 If he unburdens at all, he does so only to close personal 
friends and relatives 

36 Is reticent and retiring; does not talk spontaneously 

37 Is creative of new and sometimes eccentric ideas and things. 

38 Works by fits and starts. 
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39. Is a poor loser; considerably upset and indisposed after the 
loss of a competitive game 

40 Depreciates his own abilities, but assumes an outward air 
of conceit 

41. Is absent-minded 

42. Hesitates in making decisions on oidinary questions in the 
course of the day. 

43 ’Believes m “mind cuies”, accepts an idealistic philosophy. 
44. Has ups and downs in mood with apparent cause 

45 Rewrites his social letters before mailing them. 

46 Is slow in movement 

47. Is governed by reason rather than Impulse or emotion Is a 
good rationalizer, le, can give good reasons for his actions. 

48 Admires peifection of form in literature 

49 Makes mistakes in judging the character and ability of 
others 

50 Is thrifty and caieful about making loans. 

51. Is effeminate (if a man) 

52 Is persistent m his beliefs and attitudes. 

53 Takes up wotk which requires painstaking and delicate 
manipulation 

S4. Is conscientious. 

Marston (70) studied introversion-extroversion in young chil- 
dren two to SIX years of age His questionnaire, of necessity an- 
swered by parents or teachers instead of by the children them- 
selves, consists of twenty items, each made up of two statements 
desciibing the opposite poles of a characteristic which is thought 
to be significant of the introversion-extroversion tendency. Either 
of the two statements is to be maiked with two plus signs if 
the description of the chaiacteiistic definitely describes a child, 
and with one plus sign if the child merely inclines in that direc- 
tion Marston classifies the items of his scale as follows : 

I 

No of Items 

Social or Sclf-Attitudcs lo 

Energy Qualities 7 

Emotional Tendencies . . 3 

Guthrie (6i) mentions other methods of measuring introver- 
sion-extroversion One is a test of campus information or gossip, 
by which the degree to which “students were in touch with their 
human environment or were recluses” might be determined 
Naturally such a test has a rather marked correlation with in- 
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tclligence A second method is the free-association test A third 
method for measuimg extroversion is the degree to which a stu- 
dent appioximatcs the composite judgment of his gioup in rank- 
ing his instiuctois in oidei of effectiveness as teacheis 

Conklin (55) has also devised an inteiest questionnaire de- 
signed to test introvci t-extrovert differences Forty activities are 
listed, such as to play baseball, to hear a lecture on classical 
music, to talk with fi lends about hunting, to visit an automobile 
show, to talk uilh fiicnds about literatuie, to read essays on 
liteiaiy criticism, etc Twenty of these have been statistically 
determined to be significant of mtioveit tendencies, and twenty 
of extioveit tendencies Each of these is to be lated on a nine- 
point scale The test is scored by obtaining the ratio of the sum 
of the reactions to the cxtroveit items to the sum of the reactions 
to the introvert items 

Rlaiston (70) also studied some peiformance tests of intro- 
vei sion-extroversion 

One important fact which stands out from an inspection of the 
distiibution of scoies on an intioversion-extroversion question- 
naire Is illustrated by the following representative distribution 
taken from Miss HeidbiedeEs study. 


Table 29 

Frequency Distribution of Scores on Introversion-Extroversion 
Questionnaire 

Positive scores indicate a tendency toward introversion 
(from Heidbreder) 


Class mterval 

/ 

+ 20 

+ 24 

I 

+ IS 

+ 19 

7 

+ 10 

+ 14 

14 

+ s 

+ 9 

31 

0 

+ 4 

6z 

— I 

— S 

70 

- 6 

— 10 

108 

— II 

— IS 

8 S 

— 16 

— 20 

106 

— 21 

— 2 S ' 

66 

— 26 

— 30 ' 

35 

— 31 

— 3 S 

' 9 

— 36 

— 40 

S 

— 41 

— 4 S 

I 


600 
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People do not group themselves into the two extreme types 
The distribution of individuals in mtioversion-extioversion fol- 
lows the noimal law of frequency Few people are extiemely 
mtiovert, few are extremely extrovert, most persons occupy an 
intei mediate position, and for such people the term ambivert has 
been coined 

A second important fact is that a randomly chosen group 
of persons tend to answer more questions as extrovert 
than as introvert To be introvert is more unusual than to 
be extroveit, and could probably be classed also as moie ab- 
noi mal 

In the third place, individuals tend to rate themselves as being 
more introvert than their friends rate them 

Reliability. Hoitsma (64) reports a correlation of 674 for a 
repetition of the Colgate Personal Intentoiy C-i and 45 for one 
half against the other half The correlation of introversion scores 
vs extroversion scotes was — 36, — 22, and — 45 On the re- 
vised form C-3, Laud (68) reports a correlation of 85 between 
the rating of two associates on a third person and 90 for the rat- 
ings of a person on himself at different times Conklin (55) re- 
poits a reliability of 72 for the Colgate questionnaire, and 
Guthrie (61) lepoits a similai correlation of 60 It seems prob- 
able that Laud loses more reliability than he believes by his 
dichotomous method of scoiing Heidbredcr reports correlations 
of 55 between self ratings and associates’ ratings and 40 between 
associates’ ratings Maiston reports an aveiage con elation of 89 
for the two halves of his scale and an average correlation of 71 
between the ratings of two judges Conklin reports reliability 
coefficients of 95 and 92 foi split halves of his questions, coi rected 
by the Spearman-Brown foimula 

The evidence leportcd by Heidbredcr of the high relationship 
between the answers to separate questions and the total score 
deserves mention here 

The conclusion is that the instruments for measuring intro- 
version-extroversion have very high internal consistency, but suf- 
fer from the unreliability of the ratings Research to date indi- 
cates that something real is being measured by the questionnaires 
and that success in the use of the instruments is hampered only 
by the difficulty which subjects and associates have in answering 
the questions correctly and without bi^s, * ^ - 
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Validity. The mam line of evidence that introversion-extro' 
version questionnaires aie measuiing what they claim to be is 
afforded by internal consistency of the items Heidbreder (63), 
Conklin (55), and piobably Land have checked each item against 
the whole list and find a significant relationship 

Hoitsma (6^) and Conklin (55) report a negligible correlation 
between their own instruments and intelligence on the college 
level Guthiie also imports a correlation of 01 between the Col- 
gate Personal Inventory C-3 and intelligence. Conklin claims a 
correlation of 2 1 between the Colgate Personal Inventory and 
the “American Council Intelligence Test,” which, as it is based on 
only fifty-one cases, may piobably safely be dismissed as a chance 
deviation fiom zero The introversion-extroversion questionnaires, 
then, show negligible correlation with intelligence. 

Hoitsma repoits a correlation of + 35 between the Colgate Per- 
sonal Inventory and college scholarship Guthrie (61) finds the 
same variables to correlate only ii at the University of Wash- 
ington. This IS an issue of extreme importance which should be 
cleared up by further expeiimental work. 

Several investigators find differences between occupational 
groups with respect to mtroversion-extioversion as measured by 
a questionnaire Laird (68) reports that foiemen and executives 
arc more extrovert than introvert, while inspectois, accountants, 
and research engineers are more introvert than extrovert. Nurses 
aie extrovert as a group, Laird reports, while Pechstein finds that 
teachers are introvert. In this connection Pechstein writes (71, 
p. 196): 

“General tendencies were noted for the student teachers to be 
more introverted than sophomores, and for the teachers to be 
more introverted than the student teachers. Other comparisons 
within these major groups revealed that the older teachers tended 
to be more introverted than the younger teachers, the teachers 
with college degrees to be more introverted than the teachers 
without degrees, and the brighter teachers (as measured by group 
mental tests) to be more introverted than the duller teachers. 
. . Lack of social adjustment and adaptability is especially un- 
desirable in teachers, yet this study indicates a selective process 
whereby the more introverted women tend to get into teaching 
and to stay in it longer. 

“All the subjects were unmarried The hypothesis is suggested 
that introversion in a woman means lesser likelihood of marriage. 
The prospect of marriage keeps out of the profession many who 
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might enter, and the call of marriage draws from the profession 
the less introverted members ” 

Conklin repoits the following facts for professional groups who 
answered his introversion-extroversion questionnaire 


Table 30 

Group Differences in Extroversion-Introversion Interest Ratios 
(from Conklin, SS, p 34.) 


Student group {above freshman year) 

No 

Av 

SD 

SD 

Journalism majors (female) 

20 

1175 

404 

90 

Journalism majors (male) 

IS 

1009 

35 0 

90 

English lit majors (female) 

22 

971 

241 

S I 

Physical educ majors (female) 

13 

764 

16 2 

4S 

Business adm majors (male) 

40 

66 9 

173 

27 

Pre-medical majors (male) 

32 

772 

21 8 

38 

Pre-law majors (male) 

iS 

79 9 

20 4 

48 


Non-student groups 

No 

Av 

SD 

SD 

Life insurance salesmen 

21 

664 

165 

36 

Bank employees (male) 

112 

781 

28 2 

27 

Bank employees (female) 

, 23 

946 

243 

SI 


In spite of a more recent statement from Laird’s laboratory 
that there is no relation between introversion and vocational 
choice, It seems to us plain that the mtroversion-extroveision 
characteristic has some significance as a factor in vocational group 
differences Devices for measuimg it deserve consideration as 
techniques of vocational guidance for mature persons, and mean- 
while of course they must also be subjected to further investiga- 
tion If the results already reported are substantiated, we have 
m such questionnaires the promise of effective instiuments for 
advising persons as to the type of woik for which they are best 
fitted and in which they will be most contented. These question- 
naires deserve also to find a place in educational guidance As 
Conklin points out, taken with an intelligence test, they should 
assist pupils in choosing curricula wheiein he their greatest hopes 
for making successful adj'ustments. 

Hoitsma reports a correlation of .49 between form B and C-i 
of the Colgate Mental Hygiene Test. Schwegler (73) reports a 
similar correlation of .50. This substantiates the claim of mental 
hygienists that psychoneurotic persons tend to make introvert 
adjustments. 
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Laird (68) claims a correlation of 6o between extroversion- 
intioveision and the morphologic index, a relationship which, al- 
though higher than a more extensive investigation would reveal, 
indicates that possibly the tendency to make one or another type 
of adjustment is organic and is conditioned by the balance of 
glandulai secietion 

Laird (68) makes the statement tliat women are more introvert 
than men, while Marston finds girls moie inti overt than boys 
Heidbiedei (62) could discover little diffeience between men and 
women in total scoie on her questionnaiie, but reports that the 
two sexes make dificicnt responses on separate items. Certain 
items aie more diagnostic of introversion for men than for 
women (62, p 58). 

“It will be observed, too, that most of the traits which are more 
characteristic of women — and which aie less fiequent among men 
— are those which would mterfeie with efficient woik, while those 
w'hich aie chaiacteristic of men — and which occur less frequently 
in women — aie those wdiich would keep an individual from being 
socially agiecable” , . “It may not be amiss to observe that 
theie IS evidence in the psychological liteiatuic that character and 
personality traits aie susceptible to training, and this evidence, to- 
gether with the fact that the sexes aie not expected to do the 
same kinds of work in the w’oild, and that different modes of 
behavioi arc considered appiopiiate for men and women, makes 
It possible to explain the dilieicnces that appeal without assum- 
ing any fundamental, natice sex differences in temperament” 

Miss Heidbredcr concludes that “sex differences and introver- 
sion-cxtroveision diffeienccs act as independent variables.” 

Losing the Marston Scale, Caldwell and Wellman (53) deter- 
mined that extroversion characterizes certain types of leadership. 

“Extroveision among the girls was most marked in the Science- 
Club chan man, student-council members, and magazine-staff mem- 
bcis In all types of Icadeiship except athletics the girls w'eie 
ranked as extroveits The athletic leaders weie ranked at balance 
between extroversion and introveision The boys tended to be 
more extrovertive than mtrovertive, but not to such a marked 
degree as the girls. The two magazine-staff representatives were 
notable exceptions, ranking as decided introverts.” 

Schwcgler (73) gave an extensive battery of tests to high school 
students chosen for being definitely introvert or extrovert on the 
showing of a questionnaire resembling that devised by Marston. 
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The bisenal r expressing the degiec to which these two contrasting 
gioups differ is as follows 

T\BLr 31 

Ki->iRi\L ! lMR0\rn'.i0N-E\iK0\LKS'0\ DirrrRCNcr A^D Wriols 

Factors 

A positive coefficient indicates a higher distribution of scores by introverts 


(from Schnegler) 

Age . — 031 

McCall-Multimcntal T-score — 339 

Free-Association Test — mean time 620 

90 per cent time 494. 

response failure 198 

rcpioduction failure . — 037 

word range 107 

contiast responses . — 406 

Ink blot — nouns . . — 386 

veibs . Ill 

Motor output — unspeeded . — 250 

speeded — 328 

total . . . . — 309 

kinetic rcscive . 054 

Multiple choice . . . 257 

Wcight-sclcction . . ... OS4 

suggestibility — 216 

Motivated choice time . 159 

Picture suggestibility — 061 

Color-naming — first trial , 148 

second trial , 108 

total 145 

Trend (psychoneurotic) questionnaire . 496 

Conditioned affeet (Travis) . . — 313 


Schwcglei chaiacteiizes an introvert as being 

“in the picscnce of single familiar uncomplicated situations 

(a) slow Cl in tcibal response 

(b) less ficely productive of words, ideas, and movements 

(c) slightly moie tenacious in holding to the evidence of his 

own experience 

(d) less given to superficial automatized responses 

(c) moie inclined to moibid anxieties, to autistic tiends, and 
to psychasthenia, obsessions, and phobias, and 
(f) less inclined to admit the presence of a rich emotional life 
than IS the contrasted extrovert.” 

Ascendance-Submission 

The tendency to dominate or submit to otheis is another 
.characteiisLic which may yield to attack* by the questionnaire 
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self-rating method. A prlo)i it appears that any individual has 
enough consistency in his behavior in various situations with re- 
gard to his dominance or shyness to make it worth while to at- 
tempt to measure it G W Allport, together with F H Allport, 
studied the pioblem, assembled a series of situations in which 
persons exhibit ascendant oi submissive behavior, and finally con- 
structed questions by means of which a person could rate himself 
on this trait A sample of the questions follow (78, p. 125) 

“At church, a lecture, or an enteitainment, if you arrive after 
the progiam has commenced and find that theie are people 
standing but also that theie are front seats available which might 
be secured without ‘pisrgishness’ or discourtesy but with con- 
sideiable conspicuousness, do you take the seats? 

habitually 

occasionally 

never ” 

Two forms of the questionnaire were constructed — one for men 
and one for women Many of the questions bear close resemblance 
to questions found in introversion-extroversion questionnaires, in- 
dicating that this may be another expressive phase of some under- 
lying constitutional difference 

The derivation of the scoiing scheme is interesting and instruc- 
tive For all students who took the original test, ratings were 
also obtained (one by self and four by classmates) on ascendance- 
submission These five ratings were then averaged for each indi- 
vidual to yield a criterion An average of these criterion ratings 
Ivas then obtained for each response made on the test. 

T\ble 32 

E\ALUA'noN or Stnclf Item o^ Aliport AscrNDANCE-SuDMissioN Ques- 
tionnaire Showing ])iRnATioN or Scoring Scheme 



(from Aliport, 78, p 

iz8) 




Average of 



Average 

all criterion 


Choice 

criterion 

scores 

Diference 

Habitually . 

3 3S 

348 

+ 13 

Occasionally 

3 SO 

348 

— 02 

Never . 

3 S7 

348 

— 09 


The average of all criterion ratings was 3.48 The difference 
between those ootained for any response and the average of all 
taken to one decimal place, was used as the scoring value of the 
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item No attempt was made to determine whether the differences 
in average criteiion values foi diffeient responses uere statisti- 
cally significant, but since larger dilTercnccs icsullcd in larger 
scoring values for a response, this matter t\as soincuhat auto- 
matically cared for. 

The reliability of the questionnaire was found to be 78 for 
women who lepeatcd the questionnaire, and 58 (collected to .74 
by the Spearman-Brown formula) for men by the split-half 
method 

The questionnaire results were correlated against the ratings 
as a measure of validity For the composite ratings this correla- 
tion was 586 and 496, for self-ratings alone 633, and for asso- 
ciates’ ratings 459 But these coi relations are spuriously high, 
since they were determined on the same groups that helped de- 
termine the scoring key. On ficsh gioups correlations of 29, .30, 
and .33 were obtained 

The authors of the test point out its possible usefulness m 
personal counseling, educational and vocational guidance, and vo- 
cational placement The suggestion is made that possibly the 
questionnaire measuics one factor in leadership. 

The question as to whethei or not the test aitificially creates by 
Its very name and scoie a distinction which has no leal existence 
must be constantly boine in mind It is quite possible that the 
tendency to dominate or submit is specific foi each of the myriad 
social situations in which a pcison finds himself, and that the re- 
sponses may actually bear little leseinblance to one another How- 
ever, the very reliability of the questionnaire indicates an internal 
consistency of the items which is significant It may be that this 
consistency simply leflects the consistency wuth which the person 
considers himself, as indicated by his leplies to the questions, but 
even if this should be all its means, this fact would of itself be 
significant How'cvcr, the correlation wuth the ratings indicates 
that It corresponds to actual behavior chaiacteristics which others 
can also observe. 

The Wish 

The wish as a diagnostic instrument possesses considerable 
significance, as is shown by preliminary studies by Washburne 
By merely giving children an opportunity to state three wishes 
in various ways, Washburne (80) is able tb make significant in- 
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terpretations conceining adjustment This practically untouched 
method is deseiving of fuilhei tiial and experimentation 

An integration of these vaiious questionnaiies designed to meas- 
ure adjustment has been effected in a “Personality Inventory” 
constiuctcd by Bernieutei ■* This qiicslionnaiie of 125 questions 
to be ansueied by cnciicling one of the thicc icsponses yes, no 
and ? IS scoied in foui nays to yield (i) a measuie of neurotic 
tendency, (2) a measuie of seK-siifTicicncy, (3) a measure of 
introveision-extioveision, (4) a measuie of dominance-submission 
Reliability coclhcientb are leported for college psychology classes 
all of uhich are ovei 85 The vaiious scores have been validated 
by correlating them against (a) the Thurstone Neurotic Inventory, 
(b) the Bernieuter Self-Sufficiency Test, (c) the Laird C-2 Intro- 
version Test, (d) the Allpoit Ascendance-Submission Reaction 
Study The coefficients of coi relation, when coirected for attenua- 
tion, are all over 90 (with one exception) and most of them very 
close to I 00 An interesting set of intercorrelations indicates that 
the scale of neurotic tendencies and the scale of introversion- 
extroversion are closely related. 

Table 33 

Intercorrelations between Measures of Neurotic Tendency, Self-Suffi- 
ciency, Introversion-Extroversion, and Ascendance-Submission 
ON the Bernreuter Personality Inventory 

(from Bernreuter) 

Self- 

Neurotic sufficiency 

Neurotic tendency — 39 

Self-sufficiency 
Introvers lon-extro version 

Summary 

The questionnaire has been found useful as a measure of mal- 
adjustment The psychoneurotic inventory, as devised by Wood- 
woith, has proved its value as a measure of adaptability. It 
differentiates between normal and neurotic subjects, and between 
delinquents and non-delinquents It has demonstrated its value 
in determining the degree of adjustment of school children with 
different phases of their environment. Questionnaires have also 

* Published by the Stanlird University Press, 1931. 


Introversion- Ascendance 
extroversion submission 


93 
— 28 


— 82 

50 

— 73 
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been constiucted which measuie tendencies toward introversion- 
extioveision, ascendancv-submission, and other fundamentnl pei- 
sonality difTcienccs The wish is anotlier foim of lesponse which 
possesses consideiable significance foi adjustment These question- 
naiies piomise to be of seivice in vocational placement, in study- 
ing the adequacy of educational proceduies, and in determining 
by survey methods those individuals whose maladjustment with 
their sill roundings is seiious enough to warrant further individual 
counsel and advice 
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Chapter VI 

ATTITUDE QUESTIONNAIRES 

P ROLIFIC use of the questionnaire has been made to ex 
plore and to tap attitudes on various social and ethical 
issues. Used m this way, the questionnaire takes the form 
of a rather lengthy ballot, and indeed it is so called in many of 
the studies In describing the work done with this form, we shall 
make no attempt to present the studies m chionological order and 
no pretense to completeness. Several studies came out at about 
the same time, and much of the work has never been published. 
Our aim will be to describe those studies that offer the best sug-- 
gestions for methodology and which yield critical data. 

The present author constructed a sample ballot of 115 items on 
current social issues These items were in the form of questions, 
each of which was to be answered by yes or no Samples are (20, 
p. 316). 

I. Is It desirable that schools be permitted which are conducted 
in a foieign language? 

9. Should automobile drivers be given licenses without exam- 
ination? 

34 Should society deny any man the right to work? 

59 Should the feeble-minded be educated? 

85 Should the city maintain playgrounds? 

100. Should an accurate rccoid of births, marriages, and deaths 
be kept by a public agency? 

This questionnaire was submitted to five persons, a sociologist, 
an English professor, two psychologists, and the writer, who were 
asked to answer each question according to what they believed 
to be the liberal, progressive, or radical position. Liberal, pro- 
gressive, and radical were not defined except as being the oppo- 
site of conservative. It is not thought that these three terms are 
synonymous, but that they contain in common a point of view 
which would tend to produce a certain tyfle of answer to each 
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question. Questions which the judges answered with three y^r’s 
and two no’s or tno yes’s and thice no’s were thrown out, leaving 
1 15 for which the issue was considered definite After being 
woided so that there was an equal numbei of yes and no liberal 
answers, the 115 weie then placed in random order It is not 
contended that theie is a light or wrong answer to these ques- 
tions, but for a key the answeis which weie given as liberal, 
radical, or progressive weie used An impressionistic answer 
lather than a reasoned-out answer was desired from those taking 
the questionnaire The score is the percentage of liberal answers 
out of questions tried 

Moore (12) used a similar twenty-item questionnaire on so- 
cial issues, each item a question to be answered by yes or no 
No statement is given as to how the “key” for this question- 
naiie was constructed except that it was based on the judgment 
of Moore, who studied the questionnaire, and of Rice, who con- 
stiucted It. 

Watson (30) in his “Survey of Public Opinion on Some Re- 
ligious and Economic Issues,” described in The Measuiement 
of Fair-mindedness, has pioduced types of attitude measurement 
that will stimulate further pioductive work m this field for 
yeais to come His is a set of tests which deserves careful study 
and analysis As a total test the Watson survey may be used 
to determine the dcgiee to nhich an individual tends to lean 
toward extieme opinions in either the radical or conservative 
direction It measures the tendency to be extreme or biased or 
prejudiced m one’s attitude or opinion toward social issues as 
opposed to being impaitial or unprejudiced or “fair-minded” — 
hence the name “A Measure of Fair-mindedness ” Besides this, 
the test makes possible a determination of the strength of preju- 
dice in twelve different economic or leligious directions, as shown 
m the following list. 

1. Economic radicals. 

2. Economic liberals. 

3 Economic capitalists. 

4. Persons fighting for a “social gospel,” rather than an indi- 
vidual interpretation. 

5 Persons interested mainly in a “personal gospel,” prayer, 
mysticism, communion, salvation, etc 

6 Fundamentalists, orthodox “Apostles’ Creed” variety. 

7. Modernists, holaing liberal Christian vie^irs. 
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8 Religious radicals, very bioad, displeased with most exist- 
ing Christian manifestations of icligion 

9 Piotestants \tho aie inclined not to like Catholics 

10 Catholics nho aie inclined not to like Piotcstants 

11 Pci sons with high, strict standaids of se\-cthiCB, 01 amuse- 
ment, or “bad habits,” or similai moial matters 

12 Persons with broad, loose standards of scx-ethics, or amuse- 
ment, or “bad habits,” or similar moral matters. 

For purposes of description let us narrow our scrutiny to a 
point where we will consider the “Survey” merely as a measure 
of “economic radicalism,” choosing all illustrations to represent 
this particular issue 

In Foim A (test 1), “Cross-Out Test,” a list of words is given 
The directions aie to cross out the woids which suggest moie 
that IS disagreeable than that is agreeable Examples of such 
words are 

Big Interests, Capitalist, Ku Klux Klan, Wall Street, Landlord 

The idea for this test is taken from the Pressey Cross-Out 
Tests. The words given as samples leprcscnt to the average man 
some of the most irntating elements in economic life It is 
assumed that if a person crosses out any of these items he tends 
to oppose the established economic oider It presumably indi- 
cates a tendency to favor a change from the present state of 
economic affairs. 

In Foim B (test 2), “Degree of Tiuth Test,” and in Form F 
(test 6), “Geneializations Test,” statements are given which 
the subject is to approve or disapprove In Form B the direc- 
tions state that he is to indicate the degiee of truth of the state- 
ment If the response weie merely to indicate whether the state- 
ment IS tiue or false, this would become a true-false test, and 
as such It would not differ maikedly in piinciple fiom the ques- 
tionnaires by Symonds and Moore descubed above In this par- 
ticular instance, however, since Watson wishes his subjects to 
estimate degrees of truth or falsity, he puts before each ques- 
tion the symbols -\-z -f-i o — l — 2 one of which is to be 
encircled to designate that the statement is utterly and unquali- 
fiedly true, probably tiue, or tiue in large degree, an unchecked , 
open question, ptobably false, or false in large degree; utterly 
and unqualifiedly false. Examples are. « 
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-{-2 +i o — I — 2 The churches are more sympathetic with 
capital than with labor 

-\-2 +10 — I — 2 To have expeiienced business men, who 
have made a financial success in private 
entcipiise, hold the public offices of the 
count! y would be better than the present 
personnel 

+2 +I O -I -2 Unless industrial and economic condi- 
tions in the United States are remedied by 
sweeping changes in the present capitalistic 
system, we shall have a class revolution 

In Form F, “Generalization Test,” similar statements are 
gi\cn In these items a statement is made concerning some group 
of peisons, or events or msUtutxons , and the subject is to indicate 
how general these statements are The words All Most Many 
Few No placed before each statement offer a choice of under- 
hnings to indicate the degiee to which the statement is believed 
to be true In many psychological respects this test is similar to 
the preceding — it asks for a judgment on a statement But 
wheieas the former asks for the degiee of conviction for or against 
the statement, the piesent asks for an estimate of the degree to 
which the statement can be said to be true Those who think 
statistically will see that a man’s belief of the degree of truth of 
a statement, if based on reasoning, would depend upon his esti- 
mate of the probabilities in the situation, and this leads back to 
the generalization test However, opinions are dctei mined by so 
many other factois than reasoning that the iclationship between 
the two tests cannot be pushed loo far Examples are. 

All Most Many Few No Men of prominence in business to- 
day woikcd their way up from humble 
beginnings without money or influen- 
tial fiiends to help them 

All Most Many Few No Socialists are anxious to take away 

the money from the rich so they can 
have more for themselves. 

All Most Many Few No Poor men win important lawsuits 

against great corporations. 

The other three tests, instead of getting at opinion directly, do 
Vo indirectly by testing the reasoning that one does in connection 
with the issue Two of the tests. Form C and Form E, are 
based on the assumption that extreme, prejudiced attitudes in 
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either direction are either based on or lead to rationahzations 
Form E is based on the supposition that persons who hold 
extreme opinions tend to belittle the arguments and evidence in 
favor of the opposite point of view and try to stress or emphasize 
the arguments and evidence in favor of their own stand Work- 
ing on this assumption, Watson raises an issue and then gives 
six arguments, three of which are commonly held to be in 
favor of the issue and thiee against it Certain of these aiguments 
are strong or good arguments, and others are weak or poor argu- 
ments The directions lequire the subject to indicate w^hich are 
strong arguments and which are weak arguments Now it is as- 
sumed that if a person is unbiased, he will discriminate between 
the strong and weak arguments and mark them correctly, but if 
he IS biased, all arguments on the side he favors will appear to 
him strong and all aiguments on the other side will appear to 
him weak. The degice to which he favors one side or the other, 
and which side this is, is disclosed by the extent to which he errs 
in judging the aiguments Watson decided, because he did not 
succeed in getting arguments which competent people could agree 
were strong or weak, to mark an item biased only when there was 
unanimity in rating as strong all arguments in favor of one side 
of an issue and as weak all arguments on the opposite side. 


Example: 


Is Socialism desirable m the United 
States to-day.^ 

I. Strong 

Weak 

It would give to all the people control 
of the natural resources now in the hands 
of a few 

2. Strong 

Weak 

It would give over a great deal of con- 
trol to men who are not refined or cul- 
tured, sometimes not respectable, and 
hence would be undesirable 

3. Strong 

Weak 

Government enteiprise has not proved 
as efficient in many ways as has private 
business. 

4. Strong 

Weak 

Socialism is desirable because it would 
take away money from those who have a 
great deal and would divide it up among 
the rest of the people. 

5. Strong 

Weak 

Socialists are undesirable radicals and 
extremists. 

6. Strong 

Weak 

The old parties have become so corrupt 
that the country should turn to Socialism. 
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In this example credit was given as being an economic radical 
if numbers i, 4, and 6 weie rated stiong 


T VBLE 34 

Ratings of Arguments 
(from Watson) 


Argument number 

Side oj issue 

Test rating 

Per cent of agree- 
ment of judges 

I 

Yes 

Strong 

88% 

2 

No 

Weak 

83 

3 

No 

Strong 

100 

4 

Yes 

Weak 

92 

S 

No 

Weak 

92 

6 

Yes 

Weak 

95 

In Form C, 

“Inference Test,” 

certain facts 

pertaining to a 


situation are given. Following this brief description are certain 
conclusions or generalizations which one might draw from the 
facts Those taking the test are instructed to check the conclu- 
sions which fairly follow from the facts as given, being careful 
not to assume anything else than the evidence given in the state- 
ment They are also told, “You are not to consider whether the 
conclusions arc light or true in themselves, but only whether 
they are rightly inferred from the facts given in the statement. 
You may check as many as you believe to be perfectly sure and 
ceitain Do not check any merely probable inferences” Since 
most of the situations are described veiy briefly, no very im- 
poitant 01 extreme conclusions can logically be drawn. Accord- 
ingly any one who maiks a conclusion extreme in either direction 
may be said to be biased or prejudiced 

Example. 

Statistics show that, in the UnTed States, of 100 men starting 
out at an age of 35, at the end of 40 years one will be wealthy 
and 54 will be dependent upon relatives or charity for support 
I □ The present social order cheats the many for the benefit 
of the few. 

2. □ The average young man, under present conditions, cannot 
count on being wealthy at the age of 65 

3 □ Most men are shiftless, lazy, or extravagant, otherwise 

they would not need to be dependent 

4 □ The one man is living upon luxuries ground out of the 

bones of the mass of common people 

5 □ Some day the workers will rise in revolt. 

6 □ None of these conclusions can fairly be drawn. 
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In this example a person is given credit as tending toward an 
economic ladical il he maiks i, 4, 01 5 

Form D, “Moral Judgment Test,” is a clevei test based on 
the supposition that persons tend to be moie biased about con- 
temporary, immediate, peisonal affairs than about historical, dis- 
tant, impersonal affaiis The test consists of pairs of described 
situations, one about a historical or distant event and another 
about a contemporary or immediate event Statements are then 
given which approve or disapprove or in some way pass judg- 
ment on the story, one of which is to be maiked if agieed withf 

Example 

VI Duiing the latter part of the Great War and the years 
immediately following, officials of the United States government, 
suspecting certain organizations of being disloyal, broke into the 
headquarteis of radical groups without legal warrant. Searching 
the premises, they found publications of questionable chaiacter 
and confiscated them, and collected evidence enough to convict 
some of the ring-leaders 

1. □ Such action was quite right, where radicals weie suspected 

of opposing the Constitution 

2. □ The matter is indifferent, not to be called either right or 

wrong 

3 □ Such action on the part of the government officials was 
wrong, or at least unwise 

XII A group of detectives weie lined to investigate the ac- 
tivities of a gieat business coipoiation suspected of using some 
underhanded methods Without any legal warrant, some detec- 
tives and some dissatisfied workmen of the company broke into 
the office, found the books, and piovcd that the coiporation had 
been dishonest 

I. □ Since they got the evidence, what they did was all right. 
2 □ The matter is indifferent 

3. □ To get evidence in such a manner was wrong, or at least 

not very wise 

If XII-i was checked, together with either VI-2 or VI-3, the 
person taking the test showed a tendency to be biased in the 
direction of economic radicalism. The same tendency was shown 
if XII-2 and VI-3 were marked. 

These tests were constructed on the hypothesis that one’s ex- 
pressed attitude is based on or leads to rattonalhation. In the 
Argument Test this was supposed to be shown by the tendency 
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to overlook the logic m an argument, m the Inference Test, by 
the tendency to imagine things in a meagerly described siLudLioii 
that do not necessarily exist m the situation, and in the Moral 
Judgment Test, by the lack of consistency when the situation is 
impersonal and when it glows waim to the person’s prejudice 
The most doubtful points are whether a person’s expressed atti- 
tude IS necessarily the result of rationalization, and whether ra- 
tionalization IS the inevitable concomitant of an extreme position. 
It IS conceivable that a person might hold an extreme opinion or 
make an extreme choice w'lthout necessarily sacrificing the recti-, 
tude of his logical processes On the other hand it is an uncertain 
assumption that attitudes are based on or even very closely re- 
lated to reasoning processes Perhaps the test does not assume 
this, for It may be that the possessor of an unreasoned attitude 
resorts to rationalization only when hard pressed with the out- 
come of his reasoning processes in the test 

It may be that these tests are good tests of attitudes because 
in them a person shows the side on which he stands regardless of 
W'hat his reasoning processes may be Aie directly expressed at- 
titudes more valid than attitudes expressed under a subterfuge'’ 
If It IS true that conduct is best measured by tests w’lth a purpose 
so disguised that the person shows unconsciously his hand while 
he thinks he is doing something else, then these can be said to 
be attitude tests But better results may come from a direct at- 
tack The matter needs extended experimental study. 

In later work with leligious education tests, Watson shows his 
ingenuity in using new^-type objective test methods for measuring 
attitude. In general the method is to get the pupil to answer 
questions so as to express choices, preferences, or opinions Ex- 
amples of the various types of tests in addition to those already 
described are (28, p. 17) : 

a A brief description of an ethical problem situation with al- 
ternative solutions, one of which is to be marked best. 

Example: If you make a mistake and put a nickel for a penny 
in the slot machine, 

a. Put in four slugs to even it up. 

b. Call up the company and tell them about it. 

c. Smash the thing and get your nickel. 

d. Report it to the police. 

e. Do nothing. 
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b. A biief descilption of a pioblem situation with alternative 
solutions to be 1 anted in oidei of desirability 

Example A boy’s paients ask him to help at home He may 
Help when he feels like it 
Choose certain jobs and see that these are always 
done. 

Promise to help but give younger brother candy 
to do the work instead. 

c A brief description of a problem situation with alternative 
solutions, one to be rated as the best solution, another as the 
WORST solution. 

Example A bunch of boys are going to a dance in a town 
some distance away on a school night One boy’s 
paients think he should stay home and study. 
He may 

Say “Oh, I’m going anyhow,” and go. 

Sneak out and go 

Think It out, fairly, for himself 

Say, “Oh, all right,” stay home and sulk. 

In his “Orient and Occident” Watson (29) has a questionnaire 
to study attitudes toward Oiiental problems He divides this ques- 
tionnaire into two sections, one to determine “How You Feel,” 
the other to determine “What You Think.” An example of the 
first IS. 

Directions. Read each word listed in capital letters in the 
column below and think quickly how you feel about it Notice 
your own immediate reaction to it before you read further. 
Then read the woids or phiases suggested about it, noticing which 
comes nearest to agreeing with youi own reaction Write the 
number of that woid or phiase m the parenthesis in the right 
hand margin If none seems just right, choose the one which comes 
nearest to expressing your feeling If several appeal to you, choose 
the one truest to your first quick response. Do not try to reason 
out the logically best one. 

I. Japanese (i) Alert and progressive; (2) Untrustworthy; (3) 
Courteous, (4) Ingenious; (5) Conceited; (6) Politically 
ambitious. ( ) 

An example of the second is: 

Directions; Please indicate your opinion about each of the 
statements below by drawing a arcle aiound the letter or letters 
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in the margin which expresses your judgment. This is what the 
letters mean. 

T = True (absolutely) 

FT = Probably oi Paidy True 
D = In Doubt, Divided, Open Question 
PF = Probably or Partly False 
F = False (absolutely) 

If you do not know enough about any item to express an opinion 
about It, Cl OSS It out. 

T PT D PF F I Japan’s growing population problem can- 
not be solved unless white peoples allow free 
Japanese immigration into then countries. 

The distinction between thinking and feeling seems a little diffi- 
cult to draw in this connection It may be tiue that it requires 
somewhat more reasoning ability to extract the meaning from the 
statement than to catch the significance of the isolated phrases. 
But It would seem as though the real process of making a choice 
was about the same in the tuo instances — one based neither on 
thinking, nor on feeling, but on habitual responses of acceptance 
or aversion to one or anothei element in the situation piesented 

Hoinell Hart has done pionceiing work in the use of the ques- 
tionnaire in testing social attitudes and interests His method 
(8, p 9) consists essentially in presenting to groups of individuals, 
who are known to differ widely in recognized directions with 
respect to their socialization, stimuli so selected that it might be 
possible, on the basis of the observed differences in the reactions 
of these contrasted individuals to the stimuli, to construct tests 
and devise methods of scoring which w'ould indicate the probable 
degrees of socialization of other individuals whose attitudes it is 
desired to determine.” 

The method of marking the test, novel at the time when Hart 
fiist used It, IS indicated by the following sample: 


Sample List X 
Stub your toe + © 

Tie your shoe + — 

Come to despair + © 

Turn a page -f — 

Fill your pen -j- — 

Receive $ 1,000 © — 
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+ - 

© - 

+ © 

+ - 
© - 
+ 0 

The instructions to the subject are to put a circle around the 
plus sign @ after each thing that he is sure he likes, and around 
the minus ¥Ign 0 after each thing he is sure he dislikes. Things 
about which he does not care one way or the other are to be 
skipped A line is to be drawn under each of the five things that 
he feels most strongly about and a double line is to be drawn 
under the one thing in the list about which he feels most strongly 
of all. 

This method of indicating responses does not work entirely 
satisfactorily. In using this procedure in school, one can never 
tell whether a pupil is omitting an item through carelessness or 
neglect or because he is actually neutral to that item Conse- 
quently, It IS better to require that cveiy item be answered and 
then give a neutral symbol uhich may be encircled to indicate 
neutrality Another possible defect is the limitation placed on 
the expiession of exticme feeling by requiimg that five items be 
undei lined once and one of these doubly underlined Greater 
freedom of expiession for individual differences m feeling may be 
obtained, if desired, by including symbols to be encircled for 
indicating dcgiee of feeling 

Four lists in Hart’s questionnaire deal with things which people 
like (or dislike) to be, to do, 01 to have happen Lists 5 and 6 
contain lists of things to read or to study Tests 7 and 8 contain 
reforms which the subject may favor or disfavor. 

This “test” of Social Attitude and Interests was given on the 
one hand to a group of thirty-three men, who were taken as 
“leaders of social progress,” and on the other hand to a miscel- 
laneous group of 154, including thieves, employed boys, grade 
and high-school boys, business men, college business men, suc- 
cessful business men, junior “medics,” and other college men. The 
responses of these two groups were tabulated separately. When 
they were compared, it was found that the two groups gave 
markedly different reactions on ninety-threQ> of the 149 stimulus 


Mail a letter 
Be loved 
Be insulted 
Brush your hair 
Have a bath , 
Be seasick 
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words. Of course there is the possibility that factors other than 
degree of socialization such as amount of schooling or native abil- 
ity may influence the results, but Hart considers this possibility 
and dismisses it Hart concludes that (8, p. 37) “the differences 
in reactions arc such as to indicate that the men in the highly 
socialized gioup either arc, or believe themselves to be, or think 
It desiiable to appear, markedly more interested in international, 
economic, criminal and social justice, far more interested in 
discovering truth and having it spicad abioad freely, more in- 
terested in intellectual and ethical aspects of leligion, but less 
interested in creeds and foims, fai less interested in convention- 
ality and social approval, decidedly less sentimental and domi- 
nated by sympathy and immediate personal bonds, and much 
more indifferent to light reading, to certain aspects of personal 
comfort, to business success, and in general to trivial and selfish 
interests than the other men tested are, or believe themselves to 
be, or think it desiiable to appear'’ 

Shuttleworth (18) has used a similar technique to measure the 
charactei and enviionmental factors other than intelligence in- 
volved in scholastic success m college He used an instrument 
which he calls an “assayer,” which consists of a questionnaire 
of attitude and interests, a self-rating scale and a questionnaire 
about home and school background The self-rating section has 
been omitted in a more recent revision called “Student Informa- 
tion Blank” The results of this study of Shuttles orth’s are not 
clear inasmuch as he did not use a criterion which sharply dif- 
ferentiated between scholastic achievement and intelligence His 
correlations would seem to show that he was not successful in 
extracting items which could be used as measures of achievement 
apart from ability. 

The present writer (21) used a modification of the Hart tech- 
nique to measure students’ interests and attitudes. In its revised 
form the questionnaire contained the following lists of items: 

List I. Things one would like to own. 

2 Occupations. 

3. Activities. 

4. Places to go. 

5 . Magazines. 

6. School activities. 

7. Things to be judged right or wrong. 
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Before each item the three symbols + • — were placed, one 
of which was to be encircled to indicate liking, ncutiality, 01 
disliking Pupils weic instructed to omrt no items The 100 most 
significant items were selected, using as a cri tenon the studious- 
ness index (see p 333) or teacheis’ latings on school industry. 
Papers of 296 high school seniors in three schools were scored, 
using these 100 items If an item was answered -p or — in the 
direction of studiousness it was given a credit of 2, if the item 
was answered neutially, it was given a credit of i, and if it was 
answeied in the opposite diiection to that of studiousness it was 
given a credit of zeio 

The self-r (fifty items against fifty items) was .732 (N = 80) 
for one school, and 695 (N = 213) in another school Using the 
Spearman-Brown formula, the reliability of the whole question- 
naire of 100 Items is found to be 846 and 820 respectively 

The correlations obtained m one school were as follows* 

Table 35 

Intercorrelations of School Marks, Ratincs for Stubiousness, Studious- 
ness Index, and Studiousness Questionnaire 

(from Symonds, 21) 

Studious- 

Marks Ratings ness Index 

Terman Group Test of 
Mental Ability 339 017 

Aterage of school marks 556 

Teachers’ rating of stu- 
diousness . ... 475 

Studiousness index ..... 

In another school the correlations found were: 

^ teacher rating on studiousness — studiousness questionnaire -373 

f average mark — studioiisness questionnaire .252 

Assuming the use of a technique of questioning similar to that 
devised by Hart, and his method of evaluating the responses by 
the use of contrasted groups, this type of questionnaire has great 
possibilities. 

“There is no reason why questionnaires cannot be assembled 
to diagnose every phase of man’s activities and inteiests, using 
a technique similar to that outlined above. With empirical meth- 
ods of selecting and rejecting items, it should be possible to find 


Studious- 
ness Ques- 
tionnaire 

— 337 

IZI 

232 

■397 
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Items which correlate significantly with every phase of man’s life. 
In the measurement of conduct, invesLigaLois have been blocked 
because conduct does not leave behind a permanent record which 
may be studied and measured This can be acomplishcd by great 
effoit and expense by setting up special situations in which an 
objective recoid may be obtained as has been done by the Char 
acter Education Inquiij* There is always danger in these methods 
that a narrow or aitificial sampling of the precise conduct to be 
studied may be tested or that extraneous variables, as for ex- 
ample motives, may vitiate the results It should, however, be 
possible to find by trial and erior methods verbal responses which 
will corielate with almost any conduct trend.” (21, p 167 ) 

Another distinct technique for measuring attitudes, and one 
which is still more analytic than any previously described, was 
developed by Allport and Hartman (i). This technique consists 
in getting a wide variety of opinions on some issue and then 
scaling these opinions from one extreme position to the other. 
This graduated scale of opinions may then be read over by any 
one whose attitude is being measured, and he may indicate which 
statement best expresses his own opinion The sample of the 
scale chosen then becomes a measure of attitude. 

In their original tvoik Allport and Hartman chose seven con- 
crete issues of current interest (i, p 736), “dealing, respectively, 
with the League of Nations, the qualifications of President 
Coohdge, the distribution of wealth, the legislative control of 
the Supreme Court, prohibition, the Ku Klux Klan, and graft 
m politics. Sixty students, uppeiclassmen, were asked to write 
their personal views on the various phases of these questions. 
The resulting opinions on each issue weic then carefully sifted 
and the distinct and lelevant views weic assembled Keeping the 
issues separate, these views were piesented on slips of paper and 
arranged independently by six judges, teachers of political sci- 
ence and psychologists, in order of their logical position in a scale 
ranging from one extreme on an issue in question to the opposite 
extreme.” Later copies of the scales were distributed to students 
in another class, and each student was instructed to read the 
scale carefully, and to check the one statement in each of the 
seven issues which most nearly coincided with his or her own 
view They were also given opportunity to designate the certainty 
of their opinion and the intensity of interest or feeling upon the 
question. 
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Thurstone (23, 24, 25), sensing the importance of Allport’s 
work, has embarked on a program of developing Allport’s basic 
idea and refining the statistical methods employed, which is 
aimed to eventuate in a senes of scientifically determined and 
defensible scales 

One of Thuistone’s scales is reproduced here: 

Scale of Attitude towaid the Movies* 

(The scale value of each statement is shown in parentheses 
following Its seiial number The higher the scale value the more 
favorable the statement towaid the movies. The statements have 
been arranged in random older) 

I (15) The movies occupy time that should be spent in more 
wholesome recreation 

• 2. (13) I am tired of the movies, I have seen too many poor 
ones. 

3. (45) The movies are the best civilizing device ever devel- 
oped. 

4 (0 2) Movies are the most important cause of crime. 

5. (2 7) Movies are all right, but a few of them give the rest 

a bad name. 

6. (2 6) I like to see movies once in a while, but they do dis- 

appoint you sometimes 

7 (2 9) I think the movies are fairly interesting. 

8. (2 7) Movies aie just a harmless pastime 
9 (17) The movies to me are just a way to kill time 
10. (4 o) The influence of the movies is decidedly for good. 

(39) The movies are good, clean entertainment. 

12 (3 9) Movies increase one’s appreciation of beauty. 

13 (1.7) I’d never miss the movies if we didn’t have them. 

14. (2 4) Sometimes I feel that the movies are desirable, and 

sometimes I doubt it. 

15 (00) It is a sin to go to the movies 

16 (43) There would be very little progress without the movies. 

17. (43) The movies are the most vital form of art to-day. 

18. (3 6) A movie is the best entertainment that can be obtained 

cheaply. 

19. (3 4) A movie once in a while is a good thing for everybody 

20. (3.4) The movies are one of the few things I can enjoy by 

myself. 

21. (13) Going to the movies is a foolish way to spend your 

money. 

• From Thurstone, L L , “A Scale for Measuring Attitude Toward the Movies.” 
Journal of Educational Research, 2Z 93, 94 (Sept, 1^0). 
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22 (11) Moving pictures bore me. 

23. (06) As they now exist movies are wholly bad for children 
24 (06) Such a pernicious influence as the movies is bound to 
weaken the moial fiber of those who attend. 

25. (03) As a piotest against movies wc should pledge our- 

selves never to attend them 

26. (o.i) The movies are the most important single influence 

for evil. 

27. (4 7) The movies are the most powerful influence for good 

in American life. 

28. (23) I would go to the movies more often if I were sure 

of finding something good 

29. (41) If I had my choice of anything I wanted to do, I 

would go to the movies. 

30. (22) The pleasure people get from the movies just about 

balances the harm they do 

31. (20) I don’t find much that is educational in the current 

films. 

32 (19) The information that you obtain from the movies is of 

little value. 

33 (ro) Movies are a bad habit. 

34 (33)1 like the movies as they are because I go to be enter- 

tained, not educated. 

35 (3 i) On the whole the movies are pretty decent. 

36 (0 8) The movies are undermining respect for authority 

37 (2 7) I like to see othei people enjoy the movies whether 

I enjoy them myself or not. 

38 (03) The movies are to 'olame for the prevalence of sex 

offenses. 

39 (4 4) The movies is one of the great educational institutions 

for common people 

40. (0 8) Young people are learning to smoke, drink, and pet 
from the movies 

In using the scale for placing a person’s attitude, Thurstone 
prefers not meiely to ask a person which sample on the scale 
most nearly coincides with his point of view, but would also 
have him indicate the range of opinions which he would be 
willing to endorse The mean position of this range might not 
coincide with the step on the scale which he would state as best 
representing his position, but the mean might be a fairer measure 
of his opinion. 

In selecting statements for inclusion in the scale Thurstone 
( 24 ) 23) recognizes the following criteria (a) The statements 
should be as biief as possible so as not to fatigue the reader whe 
is asked to read thcc whole list (b) The statements should be 
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such that they can be endorsed or rejected m accordance with 
their agieeineiil 01 dibagieemeut witli the attitude of the reader, 
(c) Every statement should be such that acceptance or rejection 
of the statement does indicate something legarding the reader’s 
attitude about the issue in question, (d) Double-barreled state- 
ments should be avoided (e) Irrelevant statements should be 
avoided 

Thurstone gives three criteria by which items of the scale may 
be judged for validity (a) The scale must transcend the group 
measured In othci words it should be a tiue scale for both paci- 
fists and militaiists, or for wets and drys (b) The items must 
be unambiguous This means that the ogive curve rises steeply 
from 0 per cent agreeing to too per cent agreeing as one pro- 
ceeds up the scale of attitudes In the diagiam, item A is less 



ambiguous than item B (c) The items must be relevant The 
criterion of relevance is to see the extent to which those in a 
group who endorse the item tend to endoise items higher up 
the scale and fail to endorse items lower down the scale. 

Thurstone has performed some valuable analytical work in 
the method of scale construction from judgments preparatory to 
the actual work of making a set of attitudes scales. 

Reliability 

The reliability of the various attitude measures used has been 
meagerly determined. The extension of methods has outstripped 
their critical study. 

Watson (30) has studied the reliability of his test both for 
total prejudice score and for the separate scores for the eighteen 
lines of bias Results of 92, .89, and 96 jvere obtained for the 
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self-correlation in three groups, half of the test against half The 
reliability of the \^holc test may be placed at 96 The following 
table shows the reliability found m separate biases. 

Table 36 

Reuability of Attitude Questionnaires 


Direction of 
Bias 

1 Economic radicals 

2 Economic liberals 

3 Economic capitalists 

4 Persons lighting for a 
“social gospel” 

5 Persons interested in a 
‘ personal gospel” 

6 Religious fundamen- 
talists 

7 Religious modernists 

8 Religious radicals 

9 Protestants 

10 Catholics 

11 Persons with high, 
strict moral standards 

t2 Persons with broad, 
loose moral standards 

13 Bias against economic 
capitalism 

14 Bias against economic 
radicals 

15 Bias against religious 
fundamentalists 

16 Bias against religious 
radicals 

17 Bias against the unsci- 
entific 

18. Bias against the re- 
ligious 


(from Watson) 


Self-T 

Self-T 

(30 papers) 

(dz papers) 

74 

69 

39 

SI 

68 

S9 

84 

69 

82 

84 

94 

•79 

81 

.60 

97 

80 

64 

3S 

SI 

•52 

78 

69 

70 

46 

75 

67 

S4 

49 

40 

58 

68 

84 

46 

S8 

8S 

78 


Maximum 

Probable 
self-T on 
200 such 

number 

points 

of points 

{70 papers) 

189 

70 

93 

69 

192 

60 

188 

70 

143 

88 

182 

81 

89 

77 

164 

83 

S8 

6S 

66 

77 

107 

81 

94 

6S 

188 

.68 

205 

49 

iSi 

6S 

iSi 

88 

86 

76 

182 

.80 


The present writer (20) found a reliability of .67 for the 115 
Items on his social attitudes questionnaire. Thirty-two of the 
most significant items yielded a correlation of .62, which would 
be raised to .83 for 120 equally significant items Zeleny (31) 
found a reliability of 893 and .894 for a sixty-eight item true- 
false opinion^ test. Hprpcr (6) found that a true-false test of 
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social attitudes yielded reliability coefficients for one half of the 
test with the other half of 782, .751, and 817 in various groups 
of teachers Jones (10) finds a con elation of 71 beti^een the 
twenty-five odd and twenty-five even items of an opinion test 
with college students in which an opportunity is given for ex- 
pressing an opinion as to five degrees of truth for each item. 


Validity 

Attitude questionnaires, as measures of opinion, are valid only 
to the extent that they agree with other indications of a person’s 
choice or tendency to act. However, an expressed opinion is im- 
portant on its own account, regardless of how' well it agrees with 
the actual overt choice that might be made, or the tendency to 
act In a democracy, voting, public speaking, and other forms 
of expression help determine the policies of the state There is 
every reason for taking the results of an attitude questionnaire 
at face value as an expression of opinion, particularly when no 
immediate issue is at stake But as measures of undeilying ten- 
dencies to act they should be checked against other indications 
of choice before being accepted as valid 

Allport’s work show s that diflierent issues may result in different 
types of opinion distributions. He obtained some normal, some 
skewed, and some bimodal distributions w'lthin the college group 
which he studied There was a distinct cleavage of opinion con- 
cerning the League of Nations, and an ovciwhclming sentiment 
for prohibition with the group tested Rice, on the other hand, 
believes that great bodies of social opinion as expicsscd in elec- 
tions form a noimal distribution A really satisfactory answ'er to 
this question must aw'ait technical woi k on the evenness of scaling. 
Even the possible significance of Allport’s bimodal distributions 
must not be overlooked, although certainly in attitude question- 
naires made up of a number of items designed to measure more 
general attitude tendencies, the distribution tends to approximate 
normality. 

Allport finds that there is a distinct relationship between the 
tendency to select expressions of opinion at either extreme end 
of a scale and certainty of opinion. “Certainty of the kind here 
exhibited is not an index of objective truth, but an accompani- 
ment of increasing distortion of truth through narrowed emphasis 
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upon one phase. The man thinks he is right because he feels 
stiongly ” Allpoit found that intensity of feeling and certainty 
tend to go together, and also that there are more men than 
women at the c\ti ernes at both ends of the scale. 

Allport studied his group by means of self-ratings and various 
types of questionnaires He concluded that the radicals and re- 
actionnaiies, that is, those taking extreme positions, are moie 
like each other than the conservatives or middle-of-the-roaders 
in self-ratings of emotionality, rapidity of mot ement, self-reliance, 
ovcrestimat on of mental ability, and lack of agreement with the 
conventional moral code In the attitude questionnaiies it was 
found that radicals and reactionaires share one another’s attitudes 
on diverse questions moie fully than the conservative shares the 
attitudes of either. Again, without complete evidence. Allport 
surmises that the radical resembles the exti overt and the reac- 
tionary resembles the introvert, a conclusion which remains to 
be demonstrated. 

Watson (30) considered the validity of his tests on a number 
of grounds. He found the correlation of each sub-test with the 
total as follows: 

Table 37 

Correlations between Gross Scores for Each Test Form and Gross 
Scores for Total Test 

(from Watson) 


Test form 

Correlation 

A 

68 

B 

94 

C 

79 

D 

58 

E 

56 

F 

•53 


Watson says, 

“It seems fair, therefore, to conclude that each form of this 
test is as closely related to the purpose of the test as a whole, as 
are the various items in an intelligence test to the score of the 
test as a whole; and that the forms are more closely related in 
these tests of fair-mindedness than is commonly the case in test- 
ing various forms of arithmetical ability, reading ability, writing 
ability and the like.” 

Watson uses other means of validating his test He compared 
the score of certain individuals selected by their group as being 
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fair-minded with the score of others in the gioiips, especially 
with other groups with the reputation of being decidedly picju- 
diced, and found that the selected individuals actually did rate 
high in fair-mindedness on the test Case studies tended to icveal 
the accuracy of the diagnosis afforded by a study of the test 
results Comparison of the scores of groups which by reputation 
are fair-minded or prejudiced indicated that the tests substan- 
tiated popular impression 

The present writer (20) found a correlation betw^een his social 
attitude questionnaire and social infoimation of 28, and intelli- 
gence of 28 Jones (10) reports a correlation of less than 20 
between his measure of radicalism and intelligence Watson finds 
that his test correlates — 005 with the Thorndike-McCall Read- 
ing Test and — 08 W'lth IQ Apparently these measures of atti- 
tude have little relationship to ability 

Symonds found a definite sex difference, the boys tested tend- 
ing to be more liberal Harper found that “male graduate edu- 
cators were found to occupy a position on the scale slightly less 
conservative than the position of female graduate educators.” 
Jones reports little sex difference. 

Jones found that the college course had slight effect on the 
opinion of college students Seniois weie more conservative than 
freshmen in economic affairs, but more liberal in religious mat- 
ters. Symonds found no change in social attitude from the 
eighth grade through the college senior year On the other hand 
Harper reports a correlation of 521 between extent of education 
above the eighth grade and scoics on his test of radicalism. He 
found that a group w'hich pursued a graduate course in educa- 
tion which stimulated freedom of discussion in the atmosphere of 
liberalism raised its scoie 144 points on his test of seventy-one 
items during the year Watson also reports that in a Y M. C A 
conference and in certain classes on race problems there was 
a distinct increase in scores on fan-mindedness This would point 
to the fact that training in attitudes is very specific, that ordi- 
nary school instruction does not have much influence on social 
attitudes, but that carefully planned activities and discussions 
are capable of modifying attitudes and prejudices in the par- 
ticular field worked in 

Harper reports no relationship between attitude and religious 
affiliation or the political party espoused. 
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Walson (29) m his study of attitudes toward the Orient finds 
that the results on his quesliuniiaiie show negligible relationship 
iMth gcogiaphical sections of the country and amount of educa- 
tion There is evidence that racial attitudes are related to eco- 
nomic level, religious denomination, the reading of liberal peri- 
odicals, and having Oiiental friends. He also finds a marked 
relationship between amount of leading and a liberal attitude and 
between mfoimation and attitude. These relationships interlock, 
and until further woik is done it is impossible to tell which are 
fundamental and which are merely subsidiary. 

Do these questionnaires measure attitude^ Can attitude be 
measured' Bam* insists that w'C have no surety of a high re- 
lationship betw'een verbal and adjustment behavior, especially 
when tabooed or ill-organized behavior is the subject of inves- 
tigatioK. Thurstone’s (23) interesting discussion is pertinent here 

There comes to mind the uncertainty of using an opinion as 
an index of attitude The man may be a liar If he is not inten- 
tionally misrepresenting his real attitude on a disputed question, 
he may neveitheless modify the e.xpression of it for reasons of 
courtesy, especially in those situations in which frank expression 
of attitude may not be well received This has led to the sug- 
gestion that a man’s action is a safer index of his attitude than 
W'hat he says But his actions may also be distortions of his 
attitude A politician extends fiiendship and hospitality in overt 
action while hiding an attitude that he expresses more truth- 
fully to an intimate fiiend Neither his opinions nor his overt 
acts constitute m any sense an infallible guide to the subjective 
inclinations and preferences that constitute his attitude There- 
fore we must remain content to use opinions, or other forms of 
action, merely as indices of attitude It must be recognized that 
there is a discrepancy, some error of measurement as it weie, 
between the opinion or oveit action that we use as an index 
and the attitude that we infer from such an index.” ** 

But some outer expression must be taken as the sign or sym- 
bol of the inner indication or choice. The objective data show 
evidence that a person’s verbal expression of his opinion may 
be taken as a fair index of his attitude It is usually safer in 
measuring conduct to disguise what one is doing. If you tell a 

Bam, R , “Theory and Measurement of Attitudes and Opinions,” Psycho- 
Ingiial liullehn, 27 357-379 (1930) 

lluiistonc, L L, “Attitudes Can Be Measured” Reprinted by permission 
from the American Journal Sociology, Volume XXXIII, No 4 
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person directly that voii >iic mcasunnsj his picjustice he will at 
once be on Ins ^naul Sn tho-.c attitudes IC'ts i>i i]iit''[ioniiaiics 
oufrht to be most successful whuU make a iseison di\ nice hi' 
piefcience or choice tthen he is ippaieniU iiuent on siunc oihei 
actiMty Honetci, tlie diiect que--tioii oi ballot also seems to 
)-icld satisfactory measuics of attitusle when tlicie is no immediate 
and personal issue at stake 
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Chapter VII 

INTEREST QUESTIONNAIRES 

I N using interest questionnaires for guidance, two questions 
of fundamental importance must be considered One concerns 
the permanency of inteiest, the other, the relation of interest 
and ability Unless interests have some degiee of peimanencc, 
the determination of interests uill have little piognostic signifi- 
cance. If interests are unstable, fickle, subject to every passing 
whim, instead of repiescnting deep and undeilying trends, they 
have little piophetic value, they are of only theoretical impor- 
tance, and they have little bearing on the welfare of any one. 

An investigation of how interest and ability aie i elated is neces- 
sary m order to help determine the relative importance of each 
in guidance If interest and ability are closely con elated, one 
may be used in place of the othei, and a single deteimination of 
ability is all that is necessaiy for gning adequate advice, if, 
however, interest and ability are not closely i elated, separate 
determinations of each are needed so that one may supplement 
the other. 

Permanence of Interest 

Studies of the permanence of mteiest have been made by 
Thoindike (7, 8), Willet (9, to), Ciathoine (t), Fianklin (2, 3), 
and Fryei (4). Thorndike (8) had college jumnis laiik in interest 
the following seven activiiies as they lemcmbeied them in ele- 
mcntaiy school mathematics, histoiy, liteiatuic, science, music, 
di awing and other haudwoik Similar rankings wcie obtained for 
what the subjects remembered their order of inteiest to be duiing 
the high school period and also for their present order of inteiest 
in college The correlation of order of interest m elementary 
school and in college was on the average over .60. Thoindike 
realizes that recollection of previous interests is not altogether 
trustworthy, but offers these considerations m its defense (8, p. 
453): “I do not believe that such tendencies to read present 

239 
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interests into the past and to leave the order reported for one 
period unchanged so iar as possible, are very siiong, there being 
a contrary tendency to remember and look for differences On the 
whole, I should expect the effect of the large chance errors in 
lowering the estimate of peimanence to neaily or quite counteract 
whatever balance of prejudice there may be in favor of similarity 
of interests or projection of present conditions Into the past ” 
Thorndike concludes (8, p 456) “These facts unanimously wit- 
ness to the importance of eaily interests They are shown to be 
far fiom futile and evanescent It would indeed be hard to find 
any featme of a human being which was a much more permanent 
fact of his natiiie than his relative degrees of interest in different 
lines of thought and action ” 

Willet (9, 10) obtained answers on three successive occasions, 
at one-year intervals, to the same questions on choice of school 
subjects and futuie occupations from 488 junior high school pupils. 
He concludes “It appears that permanence of interests both in 
school subjects and in future vocations is decidedly lacking for 
the majority of these 488 pupils ” A technique of asking pupils 
their present inteiests and repeating the questions after an inter- 
val is probably safer than the technique used by Thorndike, in 
tvhich memory is depended on There is, however, a possibility 
that Willet used a classification of occupations w'hich was so fine 
that a pupil might not be credited W'lth icpeating the same answer 
merely because he changed the name of the occupation without 
really indicating a change in the fundamental character of his 
interests Willet believes this factor is not important, for whereas 
he found that 121 changes were from first to second choice, ninety- 
nine pupils interchanged first or second best-liked subject with 
the most-disliked subject He also believes fiom a study of the 
reasons given for disliking subjects that pupils in stating interest 
in a subject are influenced by their libng or disliking of the 
teacher. 

Crathorne (i), using a technique similar to Thorndike’s, found 
that of those pupils who professed definite leanings toward some 
particular occupation on entering high school, almost exactly one 
half claimed a radical change of interests before entering college. 

Franklin (2, 3) has made the most thorough and extensive 
study of the permanence of vocational interests by repeatedly ob- 
taining answers to the same questions regarding present interests 
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from the same children He reports, after carrying his work 
through three years, that (3, p 440), “the vocational mteiests 
of pupils show a high degice of permanence during the junior 
high school peiiod After three years, thice childien out of four 
still adhere to their original type of vocational preference and 
three out of five still cling to the actual choice which they orig- 
inally expressed . . . The high degree of constancy of the per- 
centage of permanency over so long a period of time indicates 
that the interests expressed by entering j’unior high school pupils 
are significant and worthy of consideration ” 

Fryer, in summing up his own woik, in which he used the tech- 
nique of Thorndike, gives the following table which represents his 
own judgment of the matter. 

Table 38 

Probable Permanency of Vocational Ambition between Various 
Developmental Periods 

(from Fryer, 4, p 477) 

Expressed tn 
percentages 

Grammar-school and high school ... 75 

High school and college . 75 

Grammar-school and college 50 

Grammar-school, high school, and college 40 

Fryer’s conclusion after surveying his findings is (4, p. 478): 
“Vocational interests cannot be used as exact guides to future vo- 
cational interests, they are only slightly suggestive of such future 
interests ” 

The upshot of this conflicting evidence is that statements by 
young people of pi ease preferences of either subjects of study or 
occupations are of little value inasmuch as they are unstabie and 
subject to change On the other hand, statements in terms of 
broad categories, whether of subjects of study or occupational 
groups, represent rather deep and underlying trends and can be 
depended on to possess considerable predictive significance In 
infancy interests are capable of taking almost any turn, being 
dependent on only the most general factors such as keenness of 
the senses, intelligence, muscular power, etc On the other hand, 
as we shall see, in adulthood interests have become rather fiimly 
established so that only infrequently and with great difficulty does 
a man make a radical change m his occupadon or even avocation. 


Expressed in 
correlation 
coefficients 

+4 
+ 5 
+ i 
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Between these two extreme periods the interests are being molded 
At first the broad trends are laid down, and mteiest becomes 
inclined toward books, people, mechanical objects, animals, plants, 
sport, etc. Gradually even these become specialized, so that nor- 
mally the period of development is also the period for the special- 
ization of interests. 

At the college level these interests have on the whole become 
so set that interest questionnaires have great prognostic signifi- 
cance, as the woik of Mooie (37), Ream (40), Freyd (30), 
Stiong (49), and Cowdery (25, 26) indicates On the other hand 
in the junior high school, because differentiation of interests is not 
nearly so marked, much less dependence can be placed on definite 
statements of interest, notwithstanding that the broad underlying 
trends of interest are probably pretty well established at that 
time We cannot deny that cleverly constructed questionnaires of 
a number of items designed to differentiate between these underly- 
ing interest trends may be of much value and significance for 
guidance even in the junior high school peiiod 

Interests and Abilities 

The relation between interests and abilities has been studied by 
Thorndike (23, 24), Hartman and Dashiell (17), Bridges and 
Dollmger (il), and Fryer (13, 14) Thorndike (23), as an addi- 
tion to the class experiments described in the previous section on 
permanency of interests, also obtained data at a later meeting of 
the same classes showing rankings in ability with respect to the 
seven categories which he used He states that reports of relative 
interest at ages eleven to fouiteen have a median correlation of 
66 with relative ability at twenty-one or later He concludes, “A 
person’s relative interests are an extraordinarily accurate symp- 
tom of his relative capacities ” 

Bridges and Dollmger (ii) criticized Thorndike’s experiment 
on the grounds that a pei son’s ranking of his ability would in- 
fluence or conversely be influenced by his ranking of interest 
Using success m college courses as a measure of ability, these 
investigators find that the correlation of ability and interest ranges 
from 22 to 28 They state, “The conclusion of practical impor- 
tance for vocational guidance is that both must somehow be evalu- 
ated separately ” 

Thorndike (22), commenting on the work of Bridges and Dol- 
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linger, gives as his belief that the method which these writers 
used underrated the true relationship Furthermoie, he points out 
that the range of interests embodied in the set of school subjects 
which any one student cairies is probably much less than the lange 
of interest in the seven subjects which he (Thorndike) used in 
his previous experiments. Thorndike maintains after reviewing 
ihe evidence that a fair figure for the true relationship of interest 
and ability is 70 

Hartman and Dashiell (17), in a laboratory experiment with 
paper and pencil tests probably coveiing a narrow range of in- 
terest, found a negligible relationship between success on the tests 
and interest in them. 

Fryer (16) has studied the relationship between the intelligence 
of a pupil and the intelligence level of the occupation which he 
states to be his choice The relationship is not at all close. The 
following table presents Fiyer’s conclusion based on his findings. 

Table 39 

Probable Relation of Intelligence and Intelligence Requirement of 
Vocational Ambition at Various Developmental Periods 

(from Fryer, 4, p 489) 



Per cent 

Correlation 


likely to 

coefficient 


approximate 

expressing 

Developmental periods 

correct choice 

relationship 

In grammar-school (later years) . . . 

30 

-ho I 

In high school (other than at graduation) 

40 

-ho 2 

In college (other than at graduation) 

35 

-ho l 

In vocational school (requiring high school gradua- 
tion 

75 

-hos 

In vocations (seeking vocation guidance, ages 20 
to AO vears) .... . 

AO 

-ho. 4 . 


Fryer concludes, “It would appear that there is some relation 
between intelligence and ability to select a suitable vocation. . . . 
Vocational interests should be consideied as one of many factors 
to be used m vocational guidance.” 

In Genetic Studies of Genius, Volume I, Terman presents data 
showing how gifted children (with high IQ’s) differ in interests 
from average children In scholastic, occupational, play, and read- 
ing interests there are noticeable specific differences between bright 
and average children A coefficient of correlation of 41 expresses 
the average relationship between the order of preferences of 
studies as stated by the children and order of quality of work in 
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the subjects as rated by the teachers Since the correlations which 
go to make up this average were lound on control and gifted 
groups separately, they undoubtedly suffer fiom restricted range. 

Kelley (i8) in his Educational Guidance tried out tests of abil- 
ity and mteiest for predicting success in various subjects He 
gues the correlation of tests of ability and interest for predicting 
mathematics, English, and history as .35, .34, and 33 respec- 
tively 

Recent studies have discussed more in detail the relation of 
mteiest in school studies to academic achievement There is a 
marked tendency toward good achievement in subjects that stu- 
dents profess to like and toward low achievement in subjects they 
dislike Langlie concludes (21, p. 248), “the results indicate that 
there is a relationship between statements of interest and grades 
obtained in single couises, and the relationship is probably sig- 
nificant enough to be of value to an adviser or peisonnel man 
There is a tendency to obtain one’s best giades in those courses 
(in college) which were liked in secondary schools, and to get 
lowei grades m those courses which were disliked m secondary 
schools ” 

But recent attempts to use interests for the prediction of achieve- 
ment have been disappointing King (19) presents such results as 
the following: 

Table 40 

Correlation of Interest in School Subjects ans Achievement 
(from King) 




Achievement 

Achievement 

Achievement 



m English 

mmathehiac Jc 

in science 

Interest in English . . . 

• • • • 

143 

— 136 

— 193 

Interest in mathematics . 


— 105 

342 

— no 

Interest in science 

• . • 

— 029 

— 260 

187 

Composite interest . 

• 

16s 

361 

276 


In summary we must conclude that the relation between ability 
and interest is distinct but not close, and that the whole problem 
IS a somewhat confused one. Too much care cannot be taken to 
define what is meant by ability If ability is used synonymously 
with intelligence, it is evident from Fryer’s data and from com- 
mon observation that the range of interests for persons of the 
same intelligence may be very wide. If interest and ability in a 
single activity are compared, we find the relationship positive but 
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low. The relationship becomes close only when a comparison is 
made between ability and inteiest in rather fundamental 01 basic 
activities where there is a possibility for wide diffei cnees in both 
interest and ability. 

Our genetic scheme used in discussing the permanence of inter- 
ests applies here In infancy, when neither interest nor ability 
has developed, the relationship is near zero. In adulthood, when 
ability in various activities has reached its probable (though not 
possible) maximum and inteiests have become specialized, the 
relationship is fairly close Between these two periods the relation- 
ship IS increasing In childhood, when the broad interest trends 
are taking form and abilities are entering upon differentiation, the 
relationship is low. The young child who has no skill in swimming 
probably has no special interest in swimming As he increases his 
skill, his interest keeps pace The expert in any line whose ability 
IS recognized is almost sure to parallel his ability with interest 
The two increase together In the junior high school period, how- 
ever, the relationship is low enough so that one forms a useful 
supplement to the other for the purposes of guidance 

The conclusion is that properly and skilfully devised question- 
naires that differentiate between rather fundamental interest trends 
ought to prove valuable devices for the guidance of adolescents. 

Measurement of Interests 

A fruitful line of attack upon the measurement of interests was 
developed under the direction of C S Yoakum in the now dis- 
solved Bureau of Personnel Research at the Caincgie Institute 
of Technology One of the fiist studies in this group, by B V 
Moore, attempted to find factors that would differentiate the abili- 
ties and interests of salesmen, designers, and pioduction execu- 
tives who were graduates from the engineering school Moore was 
interested in finding out for each graduating engineer in what 
line of work he would function most efficiently and satisfactorily. 
Of all the tests he used, he found that a questionnaire of occupa- 
tional preferences showed the greatest differentiation His method 
was the empirical one of trying out the questionnaire on appren- 
tice engineers and determining the responses for each item Moore 
describes his scoring methq^ as follows (37, p. 45): 

“A stencil in the form of a cardboard with perforations or slots 
allowing only the engineering type of occupations to be visible is 
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placed over the list of occupations which have been checked The 
nuiiibci of plus maiks is counted and lecoidcd in the margin The 
number of minus maiks is also recoided Then this stencil is re- 
moved and another stencil is placed ovei the list, allowing only 
the sales type of occupation to be visible, and then the number 
of plus and minus marks is recoided The number of plus maiks 
before engineering occupations is added to the number of minus 
maiks befoie sales occupations in order to get the number of 
checks in favor of engineering occupations The number of minus 
maiks befoie engineering occupations is added to the number of 
ph's marks befoie sales t) pc of occupations to get the number 
of checks m fa\or of sales occupations Finally the number of 
maiks in favoi of sales occupations is divided by the total num- 
bei of check maiks to get the percentage of marks in favor of 
sales occupations ” 


Ream (40), in a study of life insurance salesmen, used a more 
extensive blank Ream’s contribution to the technique was his 
selection of items that showed only a difference which satisfied 
a statistical criterion of reliability Otherwise each item was 
weighted one in making his composite. 

Frcyd (30) used two questionnaiies devised in the Bureau of 
Personnel Research which profited fiom the experience of Moore 
and Ream One was a list of seventy-two occupations designed to 
sample all types of activities. Each occupation was followed by 
L i* and D, one of which symbols the subject answering the 
questionnaire was to encircle according to his wish to indicate 
liking, neutrality, 01 disliking of the occupation The second list 
contained 129 items which a person might like or dislike These 
included a series of physical attributes of people, a series of men- 
tal attiibutes of people, and a series of miscellaneous items. Each 
of these items was followed by five symbols as shown to permit 
the recording of five degrees of likmg-dishkmg 


Fat men L! L ? D D' 

Fat women L! L ? D D! 


Freyd, interested in finding items which would differentiate be- 
tween the “mechanically minded” and the “socially minded,” used 
the following method: 

I For each group tables were made showing the frequenej"^ 
with which each symbbl was encircled. 
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Item Group 

Response 

V L 9 D 

Dl 

Total 

Fat men Mechanical 

2 14 19 6 

2 

43 

Social 

2 3 i6 6 

3 

30 


2. Select those symbols which seem to show signihcant dilFer- 
ences m the proportions of the two groups encircling them. In 
the above item the responses 14 and 3 for the symbol L seem to 
be significantly different. 

3. Determine the differences in proportion. 




4. Determine the standard error of the difference in proportion 
by the formula. 


In the above example, 

-2 _ -33 X 67 

43 

=.0081 


»2 

.10 X -90 
30 


e—.o<) 

d 23 

e .09 


2.56 


5. List the cases in which the difference in proportion is at 
least twice as great as its standard error. 

6 Determine which items will enter into the score in a positive 
way and which in a negative way. One group must be chosen to 
represent the positive direction. Each item is given a value of 
+i or — I, that IS, all items are weighted equally. 

7 The total score for any person will be the algebraic sum of 
the positive and negative values attached to the significant items 
which he checks. 


Since there are many symbols which are not included in the 
scoring scheme, it will often mean that a person will escape 
marking the scorable symbol merely by chance. Freyd, confronted 
with this criticism, answered it as follows (30, p. 87) . (a) Symbols 
were not chosen for the scoring key unless a “fair” proportion of 
the number of persons answering the item checked that symbol, 
(b) Those who use some other symbol thkn that in the scoring 
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key really are given an intermediate score (0) between the two 
key scores (+i and — i) and are hence not really ignored (c) 
“We may assume that the factors which operate to cause one to 
go into an occupation aie very diverse, and that any two or 
three are sufficient to place him in that occupation Then as long 
as an individual falls into compartments showing significant dif- 
ferences in two 01 more tests or answers to questionnaires, he 
sho^^ s evidence of having been influenced by those two factors 
to place him in the occupation in which he finds himself ” These 
arguments are not wholly convincing, especially since Freyd ad- 
mits that one or two items can influence the score 
The Bureau of Personnel Reseaich Interest Analysis Blank has 
been taken up and rigorously studied by Cowdery and Strong 
at Stanfoid Univeisity. Their modified “Interest Report Blank” 
included eighty-four occupations, seventy-eight types of people, 
thirty-four sports and amusements, six kinds of pets, thirteen 
lepresentative kinds of reading, twenty-three miscellaneous ac- 
tivities, and twenty-five school subjects Adults require twenty to 
thiity minutes to fill out the blank 

Cowdery (25) first gave the questionnaire to three groups of 
‘caiefully selected” professional people, thirty-four doctors, 
thirty-seven engineers, and thirty-four lawyers. Scoring keys were 
worked out for each gioup separately In all previous work with 
this questionnaire each item has been weighted one. Through the 
influence of 1 L Kelley, Cowdery adopted the following plan 
of weighting scoics This weight, called b, is obtained from the 
formula 

h — —i— 

(i-4=)o 

where 4 is the coefficient of correlation from the formula 

ad — be 

^ >A(a -f- c) (6 -)- <^)(a -f- &) (c -(- «^) 

This latter formula demands a fourfold table and necessitates a 
dichotomization of the L ? D series of possible responses. We are 
not told how this dichotomization was accomplished, but prob- 
ably the frequency of replies to one symbol was used against the 
sum of replies to the other two. These b’s yielded weights for each 
Item in the Medical Interest Scale from 0 to 13, with a total pps- 
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sible score for all 263 items of 920 Cowdery found that when the 
members of all thiec piofessions were scored on the Medical 
Interest Scale theie was very little overlapping between doctors 
and members of the other professions Similar results '^^e^e found 
from scales designed to measure interest in the other professions 
Using the same set of questions, scoring keys weie developed 
which gave highly differentiated responses coriespondmg with 
remarkable closeness to the interests of different professional 
groups. 

Stiong (43), using the same Interest Analysis Blank, studied 
the responses fiom eighteen different occupational gioups He 
discovered a method of weighting items which was simpler than 
the Kelley-Cowdeiy method, yet yielded equally good results. 
Strong (43, p 201) found “that there exists a nearly straight-hne 
relationship between the b scores and the difference between the 
percentage of the profession in question and the percentages of 
all the professions This straight-line relationship holds very well 
if either, or both, of the percentages are between 8 per cent and 
92 per cent inclusive Within these limits differences of o to 2 
are weighted o, 3 to 7 are weighted i, 8 to ii, 2, 12 to 15, 3, 
16 to 20, 4, etc If either of the percentages is between o and 2 
or 98 and 100 inclusive, the differences are weighted as follows 
o, o, I, I, 2 to 3, 2, 4 to 6, 3, 7 to 10 , 4, II to 14, 5, IS to 18, 6, 
etc An intermediate set of weights are necessary if either per- 
centage is between 3 and 7 or 93 and 97 Use of this table possibly 
introduces a slight error in some cases but the gieat saving in time 
makes possible the calculation of many more data ” 

Strong’s (43) method, then, for determining a scoring key for- 
personnel managers, for example, is as follows 

1. The peicentages of personnel managers who like, are in- 
different to, and dislike each of the 263 items are obtained. 

Item L D 

Actor 49% 38% 13% 

2. The percentages of men from eighteen occupations who like, 
are indifferent to and dislike each of the items were obtained. 

Item L ^ D 

Actor 38% 35% 27% 

3. The differences between these two sets of figures are ob- 
tained. 
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Item L ? D 

ALlor + II +3 — 14 

4. These figures are replaced by smaller figures according to 
the scheme described on page 249 as follows 

Item L ^ D 

Actor +2 — 3 

5 An individual is scored by the use of this key. 

Strong (44) has recently (1927) thoroughly revised and ex- 
tended this questionnaire, now calling it “Vocational Interest 
Blank ” The experience with the questionnaire to date leads 
Strong to maintain that “members of each occupational group 
have characteristic likes and dislikes which distinguish them from 
other occupational groups.” Such evidence as is at hand indicates 
that “these vocational interests are present in college freshmen 
and are not materially altered by technical training and subse- 
quent professional experience ” 

The possibility that this same type of interest questionnaire 
might prove to be valuable m helping boys to discover their in- 
terests when a first choice of curriculum has to be made in the 
high school led Garretson (32) to assemble a set of items for a 
tryout in three New York City high schools — ^the De Witt Clinton 
High School (an academic high school), the High School of Com- 
merce, and the Brooklyn Technical High School. His original 
questionnaire has been revised and at present consists of nine 
sections as follows. 


Table 41 

Garretson’s Interest Questionnaire* (Sections and Numbers of Items) 


Number 

Section of Items 

1 Occupations . . 80 

2 Activities , 30 

3 School subjeas 20 

4 Job activities 20 

5 a School paper 15 

b Football team 18 

c Student activities 18 

6 Prominent men . . 12 

7 Things to own 26 

8 Magazines 24 

Total . ■* 263 


• Published by the Bureau of Publications, Teachers College, 1930 
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In the scoring for Garretson's revised questionnaire weights of 
only +, o, and — aie to be used for each response according 
as It reliably indicates preference for, neutrality toward, or dislike 
of, any particular curriculum, because Garretson found in the 
course of his work that this simple method yields results as satis- 
factory as a more elaborate weighting system. Plus items are 
given a weight of 2, neutral items, i; and negative items, o. In 
scoring, the number of positive items is counted and multiplied by 
2. The number of negative items is counted, added to the number 
of positive Items, and the total subtracted from 263 to give the 
number of neutial items, which, since they are one each, are 
added to the value of the positive items for the total score. 

In printing the questionnaire, the three possible responses to 

L 

each Item are printed in a column I so as to make the scoring 

D 

procedure simpler. 

Table 42 

Distributioks for Three Schools in Questionnaire Scored for 
Commercial Interest* 


Brooklyn De Wftt Chnton 
Technical High High School High School of 


Score 

460-479 

440-459 



School 

[dcademic) 

Commerce 

S 

19 

420-439 

400-419 




1 

I 

9 

Z3 

45 

380-399 

. . 


6 

20 

53 

360-379 , . . 

. . . 


8 

33 

55 

340-359 

« 


15 

SZ 

69 

320-339 

. . . . 

. . 

49 

69 

65 

300-319 



52 

60 

47 

280-299 

. . . • 


73 

89 

29 

260-279 

. 


83 

64 

27 

240-259 . . 

. . 


86 

54 

IX 

220-239 



85 

Z3 

6 

200-219 

. . E. 


Sz 

16 

2 

180-199 

• . . . 


31 

10 

• 

160-179 . . . . 




14 

3 

. 

140-159 

.... 


I 

-- 

I 

120-139 . 



•• 

• - 

- 

IOO-II9 . .. 

. .... 


• • 

• , 

• 

80- 99 . ... 




. . 

« • 

. 

60- 79 



. 

. . 

• 

Total 

. . . 





Mean 

a 



26449 

47 34 

30021 

*9 59 

35063 

55 03 


*From Symonds, P M, Tests and Interest Questionnaires in the Guidance 
of High School Boys (Teachers College, Bureau of Publications, 1930). 
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Reliability 

Hubbard (35) studied the leliability of the original Freyd 
blank and found it to be low and unsatisfactory as follows 

ABLe 43 

Reliability of Freyd Interest Blank 
(from Hubbard) 

Men TFoinen 

Time betaeen tests r N r N 

Six nceks S 2 (28s) 47 {313) 

One j'car 64 (156) 49 (193) 

Pei haps this low reliability may be explained in part by the 
scoiing scheme, which utilized only a small fraction of the pos- 
sible responses that might differentiate. 

However, when the results are portrayed so as to show the 
amount of shifts from one division of the scale to another, there 
appears a considerable amount of stability, particularly among 
those who obtain extreme ratings. 

Table 44 

Showing Changes in Score from Original Test (Freyd’s Interest Analysis 
Blank) to Retest 
(from Hubbard, 35, p 621) 

FER CENT OF CHANCE 

136 285 

Year Sve weeks' 
interval interval 

56 58 remained — z or less 

26 14 moved to between — i and 

+ I 

18 28 moved up to -h 2 or more 

100 100 

63 Remained + 2 or more 

2$ 25 moved to between -h 1 

and — 1 

8 II moved to — 2 or less 


Of those who were + 2 or 
more . . . .... 

Of those who were + 2 or 
more 

Of those who were + 2 or 
more 


Number of cases 


Of those who were — 2 or 
less 

Of those who were — 2 or 
less 

Of those who were — 2 or 
less 


100 


100 
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Cowdery (25) reports the following coefficients of reliability 
which were obtained fiom random halves of the questions, cor- 
rected by the Speaiman-Brown formula. 

Table 4S 

Reliability CoErnciENTS of Vocitional Interest Blank 
( from Cowdery, 25, p 138) 



Legal Engineering 

Medical 



scale 

scale 

scale 

Average 

Experienced men . . 

90s 

802 

77S 

827 

Graduate students 

8 (.6 

831 

778 

818 

Upper division uiincrsity students 

813 

786 

79+ 

798 

Lower division university students 

782 

661 

685 

709 

High school senior bos's 

00 

844 

697 

798 


These are fairly respectable reliability coefficients Further work 
must be done to harmonize these findings with those of Hubbard. 
Garretson reports reliability coefficients obtained by correlating 
odd responses with even responses of .911 for technical prefer- 
ence, 861 for commercial preferences, and .756 for academic pre- 
ference, which aie raised to .953, 955, and .861 by the Spearman- 
Brown formula for the whole questionnaire 

Validity 

The Carnegie Institute of Technology questionnaire has been 
tried out only in connection with vocational differentiation and has 
been found to yield remarkably promising results. Strong (44) 
has the records from eighteen occupations and is able to deter- 
mine (a) the scores which members of any one profession made 
on scales for other occupations and (b) the scores which members 
of different occupations made on the scale of any one profession 
Strong has used three letters A, B, and C to describe the position 
of a man on any scale. A means a score attained by 75 per cent 
of the criterion group, B means a score made by the lowest 25 
per cent of the men for whom the scale was developed, and C 
means a score lower than that made by any man in the given 
occupation 

Strong (47) later found that members of many occupational 
groups rate as executives to a surprising degree, the overlapping 
in the case of the executive interest scale is three to four times 
that obtained from the other seven occupational interest scales. 
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Table 46 

Percentages or Men in Variols Occupations Who Rate A and B in the 
Interest of Certified Public Accountants, Engineers, and 
Personnel Managers 


(from Strong, 47, p 233) 



S^^de for 



Scale for 


certifitd ptibhe 

Scale for 

personnel 


accountants 

engineers 

managers 


A 

B 

A 

B 

A 

B 

Certified public accountant 

74 

25 

2 

2S 

3 

31 

Banker 

20 

39 

2 

31 

0 

12 

Office worker 

2 

39 

7 

25 

2 

24 

Lau ycr . ... 

s 

23 

5 

25 

7 

40 

Engineer 

2 

21 

75 

25 

3 

31 

Peisonnel manager 

6 

13 

6 

33 

75 

25 

Author 

2 

16 

4 

18 

2 

10 

School-teacher 

. 2 

IS 

8 

29 

4 

31 

Life insurance salesman 

4 

12 

0 

29 

5 

30 

Adveitising man 

. 0 

10 

• 

. 

6 

38 

Doctor 

.. 0 

8 

9 

42 



Minister . . 

.. 0 

0 

0 

10 

2 

34 

Artist 

.. 0 

0 

4 

29 

0 

0 

Line executive . . . 

. • • • 

• • 

» 

. » 

7 

32 

Department store salesman 

• • • • 

• • 

• • 

• 

0 

14 


Table 47 

Correlations between Scores Obtained by Certified Public Accountants, 
Engineers, and Personnel Managers in Their Own and Other 
Occupational Interests 


(from Strong, 44, p 236) 



Certified public 


Personnel 

Scale 

accountant 

Engineer 

manager 

Certified public accountant 

1 00 

— 21 

10 

Line executive 

31 

37 

IS 

Lawyer 

19 

— 36 

47 

Banker . ... 

. .. 16 

— og 

— 39 

Advertising man 

.... 13 

08 

30 

Engineer 

13 

1 X 30 

s-36 

Office worker 

.12 

—.26 

26 

Author . 

J06 

06 

14 

Artist ... ... 

03 

— 01 

— 23 

Life insurance salesman . 

.01 

— 08 

.31 

Personnel manager 

00 

— 30 

1 00 

Minister 

. . . —.14 

— 52 

— 10 

School-teacher 

— 2S 

— SI 

00 

Doctor . . . 

—30 

+.21 

. 


Department store salesman , ~ 7.04 
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Strong says that possibly among various occupations there are 
to be found a large percentage of potential executives 

Cowdeiy (25) has demonstrated that scales developed from the 
interest questionnaire correlate about zero with intelligence scores 
“Significant relations with intelligence test scoies are lacking, a 
slight positive relation with Thorndike test score was noted in 
the case of the respective interest latings of law and engineering 
students ” 

Cowdery reports a conelation of 335 between engineering in- 
terest scores and university giades m engineering subjects He 
also found that “freshmen and sophomores planning to be en- 
gineers, juniors and seniors in engineering school, engineers in 
graduate work, engineers with less than five years’ practical ex- 
perience score approximately the same on the same mteiest test 
The same holds true with respect to physicians and lawyers ” 
This evidence is presumptive that those preferences are not tran- 
sient affairs but represent rathei permanent turns in the individual 
interests It also suggests that such interests do not arise from 
training, but rather tend to select people for various vocations. 

Garretson (32) found that his questionnaire differentiates in a 
remarkable way between the intciests of one curriculum group 
and another Biserial r’s between the distribution of scores in one 
curriculum and another are given in the following table These 
relationships were found on diffeient groups from those from 
which the scoring keys were obtained. 


Tible 48 

Reliability Coefficients of G\RRErsoN Interest Questionnure for High 

School Sfudenfs 


Questionnaire 
Sconnff Key 

Gai rctson’s Data 

Syinonds’ Data 

Cotnwer 

eial against 
other two 

Technical 

against 
other t7w 

Academic 

against 
other two 


Tl ehiiical 
Cominci- 
cial 

Academic- 

Commer- 

cial 

Commercial 

.727 



—433 

— 809 

— S44 

I'echnical .... 


868 


730 

706 

197 

Academic . . . 



560 

— 637 

— 274 

433 


The relationships as expressed by these bisenal r’s are higher 
than are commonly found between intelligence tests and later 
measures of achievement In other words, this questionnaire pre- 
dicts considerably more accurately the cilrnculum a boy will 
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choose than an intelligence test will predict his success in that 
LUiiiculurii. 

This appioach to the measuiement of interests promises to be 
very valuable to both educational and vocational guidance Part 
of the success of the method lies in its application of the sampling 
theoiy. It is well iccognized that a single statement of a boy’s 
choice of occupation or other interest is comparatively unreliable 
A u eek later the boy may have changed his mind But the cumu- 
lative evidence fiom a hundicd oi so questions indicates reliably 
the direction of interest along one oi moie bioad general lines 
Intciest questionnaires seem to be piomising, not because they 
will answer questions as to choice of specific curricula or occupa- 
tions, but because they point out certain broad trends of interest 
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Chapter VIII 

TESTS OF CONDUCT KNOWLEDGE AND JUDGMENT 

I N a discussion of methods of measuring conduct, tests of 
knowledge and judgment certainly should not be neglected 
Just what relation they have to conduct will appear in the 
sequel Without anticipating our discussion, we may say that 
It seems only natural that men should have turned to tests of 
knowledge and leasoning for the diagnosis of conduct Even the 
popular belief that action follows knowledge, that we reason out 
beforehand the course of action we are to pursue, gives warrant 
enough to investigators to experiment with tests of this type. This 
belief permeates our institutions, our educational theories and 
piactices, and indeed all our iclations uith our fellows Courts of 
criminal law decide responsibility for an act on the basis of the 
defendant's ability to discriminate betucen light and wiong 
In the following discussion different types of tests will be de- 
sciibed, some of the better tests in detail Following this, coriela- 
tions will be reported which show exactly what relations the 
abilities shown bv the tests have to measures of conduct and in- 
telligence Finally, the significance of the tests and the bearing of 
knowledge on conduct will be discussed. 

Tests of the Vocabulary of Conduct 

Fundamental to tests of knowledge and discrimination is ability 
to understand the vocabulary of the field of conduct undei con- 
sideration There arc few vocabulary tests in special fields, and 
none is available which coveis conduct m the fields of health, the 
use of language, the handling of property, etc. Good vocabulary 
tests have been constructed in the social-ethical field by Kohs (34) 
and Schwesmger (40), and a special test of slang by Schwesmger 
(39) IS available Kohs’ vocabulary test (number 4 of his battery 
called “Ethical Discrimination Test” *) is in the multiple-response 
form and consists of forty-five items, of which samples follow. 

' Pulilishcd m 1922 andjsold by C H Stoclting Co, Chicago 
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(a) bad means clean, wiong, both, good 

(b) rc\enge iiieaiisj sheep, beai with, treat, get even. 

(c) guilt means offense, guillotine, sword, golden 

By far the best piece of work m this field is the social-ethical 
vocabulaiy study of Schwesinger, conducted under the direction 
of the Character Education Inquiry This special vocabulaiy com- 
prises the words commonly used to describe situations involving 
human relations (.joke, company), teims used in deciding moral 
issues (illegal, villain), adjectives which denote modifications of 
character (bashful, lecalcitrant), abstract nouns indicative of 
states of mind and character tiaits (uncertainty, snobbishness), 
and terbs indicating behavior of human beings towards each 
other (scoffing, pitying) A few “shady"’ woids consisting of slang 
and professional ciook teims wcie included Thorndike’s “Tests 
of W'^ord Know ledge” w ei e accepted as a model, and 1,000 vocabu- 
laiy Items were constructed Seven hundied of these are for test- 
ing w'ords which occur in the fiist 10,000 of common usage as 
defined by the “Thorndike Word List” These 1,000 words were 
broken up into five gioups of 200 each The five tests thus con- 
structed were given to school-children, and an elaborate item 
analysis followed Three hundred most “symptomatic” items 
were separated into tw'o forms of 150 woids each, made equal in 
difficulty on the basis of the earlier testing These tests, when given 
to children in grades five through eight, showed very high relia- 
bility Seventy-five items in Foim A correlated with the other 
seventy-five 966 The same correlation in Form B was 961. 
Corrected so as to apply to the whole test the correlation becomes 
.98 Discussions of the significance of the test will be given later. 

Schwesinger (39) had also constructed earlier a test of the vo- 
cabulary of slang which she hoped would have some significance 
with relation to charactei and would help separate delinquents 
from non-delinquents The test has four sections as follows (39, 
p. 254 f.): 

I. Definitions — 25 items. 

Samples 

Simp means dumbbell, simpleton, sink, scab. 

' Ivory Dome means cupola, bonehead, elephant’s tusk, 
foolish. 

Iron Men means crusaders, dollars, bones, rubes. 

Sheeney means Silky, Kike, Jips, Jew. 

To be Stewed means dru^, cooked,® tanked, batty. 
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II Phrases — ^<<8 items. 

Samples 

Let George Do It means 
Pass the buck 

Avoid accepting responsibility 
Let Geoige try if he wants to 
To get away with it 
He Has A Screw Loose means 
He needs to be mended 
He isn’t quite sane 
He is a hunk o’ cheese 
He IS dippy 

III Same-Opposites — 50 items. 


Samples 


cold 

hot 

Same 

Opposite 

dragged out 

all-in 

Same 

Opposite 

stall 

jostler 

Same 

Opposite 

razz 

tease 

Same 

Opposite 

hand-out 

food 

Same 

Opposite 


IV. Classification — 18 lines, 124 items. 

Samples 

Write the letter S under every word that is used in Smok- 
ing. 

Write the letter G under every word that is used in 
Gambling or Cards. 

Write the letter D under every woid that has something to 
do with Dunking ^ 

Write the letter T under every woid connected with some 
form of Thieving. 

Write the letter F under every word that has to do with 
Fighting. 

Write the letter A under every word that has to do with 
Getting Arrested 

Write the letter W under every w'oid that has to do with 
Woman 

Write the letter C under every word that has to do with 
Craps. 

1. nicked, moll, four-flushing, jimmying, sand-bag, whop- 
per, salt-creek. 

2. deal, dead-head, switched, booze, skirt. Little Joe, 
moll-buzzer. 

3. slacker, nicotine, pulled, kick, scalawag, dive, rough- 
house, flif^ver. 
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Tests of Knowledge of Conduct 

Tests have been made of the knowledge of conduct in the field 
of health by Gates and Strang (22) and in the field of manners 
by Orr (27). In other phases of conduct, knowledge tests have 
not been developed. This is probably indicative of the fact that 
direct knowledge of desirable conduct is not a subject of in- 
struction. In most phases of conduct society is probably not quite 
sure enough of the validity of its mores to codify them and test 
the knowledge of the codes. 

The “Gates-Strang Test in Health Knowledge” is a very 
fine contribution to the growing collection of standardized tests. 
It IS based on a very extensive and thorough analysis and inven- 
tory of facts and principles of hygiene found in twenty selected 
courses in health offered in rural and city schools in America, and 
in fourteen of the most widely used textbooks on health These 
facts and principles were judged for their validity by experts. 
Seven hundred and forty-four exercises in the multiple-choice 
form were constructed and given to a large number of pupils in 
elementary schools, high schools, and colleges. Of these 224 were 
eliminated for such reasons, accoidmg to the authors, as, “They 
were essentially duplicates of others, no scientifically correct an- 
swer could be made at the present time; the results from testing 
showed ambiguous, misleading or confusing statements, the ex- 
perts thought the idea too trivial or too technical to Include ” 

The result of the research is a list of 520 items classified under 
topical headings and arranged from easiest to hardest in each 
group, with difficulty values appended From this list a teacher 
may select items to form a test of any length, level of difficulty, 
or range of difficulty, or on any topic or combination of topics 
For those who wish printed tests, short forms of sixty-four items 
selected from various topics and arranged according to difficulty 
have been prepared * 

Samples of the exercises are given below: 

I. We should have fresh air 
— —all the time 

m the daytime but not at night 

at night but not during the daytime 

especially in summer 

when we begin to get a headache. 

•Published by the Teachers College Bureau of Publications. 
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2 Bojs and gnis should biush their teeth 

twice a vcai 

once a month 

t\Mcc a day 

once a tt cek 

twice a week. 

3. Keeping school desks neat and floor free from papers should 

be done by 

the teacher 

the jamtoi 

the piincipal 

the pupils 

the paients. 

Miss C. I Oir, working for the Character Education Inquiry, 
has made a “Test of Good Manners ” This is an information 
tc''t of the curient standards of courtesy and good manners, using 
thiee types of tests — the true-false, the multiple-choice and the 
single-answer Samples of the items follow (27, p. 199) . 

Tiue-Fahe 

In helping yourself to sugar always use your 
own spoon True FaUe 

When yaw ning, make no attempt to suppress 
it by covering the mouth. True False 

Multiple-Choice 

Approval of a piogiam may be shown by 
I. Stamping feet 
2 Clapping 
3. Whistling 

Yes-No 

Should a man tip his hat to a strange lady 
when picking up an article which she has dropped'* Yes No 

If a door is closed, is it necessary to knock before 
entering a friend’s room? Yes No 

Tests of Biblical knowledge. It seems a far cry to include 
tests on the Bible in a discussion of the diagnosis of conduct, and 
one must understand the history of the movement to comprehend 
the reason for the relationship between the Bible and conduct 
The reasoning involved has never been very direct On the one 
hand, since the Protestant Reformation made religion depend not 
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on ihc aullioiily of the church but on the authority of the Woid 
as It IS printed in the Bible, it became extremely impoitant that 
Protestants should know how to icad the Bible as the guide to 
salvation The zeal shown by some cailj missionaiics in teaching 
the heathen to read was foi the sole puipose of enabling them to 
lead the IIol) Sciiptures On the othei hand, Chiistianity is in- 
tcipietcd to mean a way of Inung, moies, a set of customs In 
chuich school mstiuction, both the Bible and the Chiistian virtues 
and standard of living aie taught It is little wondei that some 
relationship between the two should be sought This relationship 
has been lationalized by asserting that the Bible is filled with 
all kinds of social and ethical relationships and that the teachings 
of Jesus aie the bases for the Christian standard of living Ac- 
coidingly, in some quarters, ethical mstiuction and Bible in- 
struction aie synonymous In view of this fai-icaching belief, it 
certainly behooves us to include tests of Bible knowledge in our 
review so that the exact relation betw'een Bible knowledge and 
conduct may be detei mined 

The first of these “Biblical Knowledge Tests” appealing in 
1920 was prepared by M T Whitley* and was designated the 
Old Testament, Series A This test was intended for children 
nine years old and upward who are above the primary depart- 
ment of the Sunday-school. There are five tests. 

Number 


Test no. Title Form of item of 

items 

I Relationships and Loca- Multiple-choice 30 

tion by Books 

II Source of Quotations Single-answer 

matching 10 

III Order of Bible Books Two alternatives 

multiple-choice . ■ . IS 

IV History Facts Single-answ^er . 40 

V. Completed Quotations Completion ii 


This test may be criticized as testing a very superficial knowl- 
edge of the Bible because it is mainly concerned with its mechani- 
cal make-up, “historical” facts, and recognition of quotations. It 
fails entirely to test knowledge of the teachings, point of view, or 

• Published by M T Whitley, Teachers College, Columbia University, New 
York. City 
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development of the Bible It represents the traditional objectives 
of Sunday-school teaching 

Dr Whitley has since published a New Testament test of the 
Bible 

Anothei test also published in 1920 was by J T Giles (23, 24) 
This was a simple test consisting of three parts in the true-false 
form. The fiist twenty-five quesUons cover information on the 
Old Testament, the second twenty-five questions cover informa- 
tion on the New Testament, and the last twenty-five are questions 
involving ethical judgment 

Giles’ test was revised in 1924 by W L Hanson, who turned 
all Items into the best-answei or multiple-choice form and m^de 
some other minor changes Tvto forms are available. 

A test supeiior to those so far mentioned was constructed by 
S R Laycock — ^the “Laycock Test of Biblical Information ” * 
The test comes in seven parts as follows 


-Vo of No of 

test Subject of test Type of item items 

I General information Multiple-choice 15 

II Knowledge of what cer- 

tain passages contain Multiple-choice 8 

III General information True-false 30 

IV Knowledge of what cer- 

tain passages teach Multiple-choice 8 

V. General information . Multiple-choice ij 

VI. Miscellaneous knowledge 

Items Multiple-choice 8 

VII. General information Multiple-choice 16 


too 


The distinction between the parts is not clear-cut. Probably 
there is an advantage m splitting up the test into separate sub- 
tests in order to break the monotony of a long, unbroken string 
of Items for the younger pupils. But the separate parts seem 
to contain no special diagnostic significance. 

An “Advanced Bible Knowledge Test” by G. B. Watson and 
Eliot Porter is published as one of the series of “Character and 
Peisonality Tests” of the Association Press. It is composed as 
follows . 


* T ublished in 1922 and for sale by the University of Alberta Bookstore. 
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No of 


No. of 

part Subject of test 

Type of Item 

items 

la Old Testament 

Multiple-choice 

.. 17 

Ib New Testament 

Multiple-choice 

28 

II Brief outline of the con- 

Completion ... . 

... so 

tents of the Bible 

9 S 


Tnis test involves a more highly developed critical ability than 
the simpler information tests previously mentioned 

Besides these tests which deal specifically with Bible infor- 
mation there are other tests dealing with religious ideas and 
judgment. 

The “Chassell Test of Religious Ideas Involving the Ranking 
of Selected Answers” by C. F. Chassell-Cooper and L M Chas- 
sell-Toops (14) provides a check on the pupil’s religious ideas 
The following questions are asked* 

I. What IS the purpose of the Old Testament^ 

2 What IS the purpose of the New Testament? 

3 How do you think of God? 

4 How would you describe God? 

5 What does God do^ 

6 How do you think of Jesus ^ 

7. Why should we pray? 

Following each of these questions is a list of several possible 
answers, and the pupil is instiucted to choose the three best 
answers, marking them i, 2, and 3 according to first, second, and 
thud choice. 

There are also tests of the comprehension of proverbs, fables, 
and parables Otis included a proveibs test in his original intel- 
ligence examination. In this test one has to match proverbs and 
matter-of-fact statements which might explain them Kohs (34) 
used a similar test m his “Ethical Discrimination Test.” Kohs, 
however, used the multiple-choice form, as in the following ex- 
ample. 

People who live in glass houses must not throw stones, means 
( ) Do not put all your eggs m one basket 

( ) Those who have faults should not criticize others 

( ) An hour may destroy what it has taken years to build. 

In the “Binet-Simon Intelligence Scale” there is a fables test. 
In the “Stanford Revision” the following fables are used “Her- 
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cules and the Wagoner,” “The Milkmaid and Her Plans,” “The 
Fox and the Ciow,” “The Farmer and the Stork,” “The Millci, 
His Son, and the Donkey.” After the fable is read to a child, he 
is asked, “What lesson does that teach US'”’ Lowe and Shimbeig 
(35) tried out this test as a moial judgments test. 

The “Drew Tests in Religious Education” by C F Chassell- 
Cooper (12) repiesent one effort in this direction The first of 
these is a test of the ability to interpret ten parables, viz , “The 
Lost Sheep,” “The Lost Coin,” “The Prodigal Son,” “The Good 
Samaritan,” “The Sower,” “The Ten Virgins,” “The Rich Fool,” 
“The Rich Man and Lazarus,” “The Unmerciful Servant,” “The 
Pounds ” Multiple-choice questions are asked about each paiable 
There are also self-rating and teacher-rating charts as part of 
these tests. 


Moral Judgment Tests 

The “moral judgment” or “ethical perception” test has had a 
long history Staiting, peihaps it may be said, with the com- 
prehensive work of Sharp, this type of test has suggested the 
efforts of various woikers, including Fernald, Haines, Healy, 
Watson, and otheis Foimerly the essay type of examination 
question was asked, but moie recently the whole movement has 
had a renaissance, using the newer objective testing techniques. 
Of the older tests, that by Fernald (20) has had the greatest 
use and publicity His method has been experimented with con- 
siderably in Geimany. The illustration given heie is taken fiom 
Sharp’s earlier work (41) It is interesting to note that Sharp 
used his tests to help throw light on the issue as to whether we 
solve a moral situation by means of geneial principles or by 
noting the effects of a course of action, an issue which is still 
unsolved and acute to-day Sharp used such questions as the 
following (41, p 202) 

(a) In a small Western village a switchman was just about 
to turn the switch for an approaching express tram when he saw 
his little son, his only child, playing upon the track The choice 
had to be made between the life of the babe and the lives of 
the passengers. What ought he to have done’ (b) In the case 
just cited the man was on duty. What should be the decision 
under the following circumstances^ 

A drunken switchman has left the switch open. A man who 
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lives near the tiacks notices the green switch on his way home 
fiom woik and is just about to turn it to sa\e the train, when 
he sees his only child upon the tiack just in fiont of the engine. 
The alternative is as in (a) 

With the advent of the objective testing movement, tests of 
moral knowledge or judgment or discrimination employ questions 
that aie simpler and more objective Since the test items are now 
shorter and simpler, it is impossible to set up such a heait- 
rending moral dilemma as is illustrated m Shaip’s question The 
moial situations in the new-type tests are simpler Since many 
moic Items can be given in the same testing tune than were em- 
ployed for the old examinations, it is believed that ability to 
make moral discriminations is bettei tested by the wider sam- 
pling of pioblcms, individually not so neive-iacking A similar 
change in point of view has taken place in the testing of school 
subjects To-day, instead of an examination consisting of difficult 
problems in algebia, several shortci and easier questions are 
used in place of each difficult one, with even bettci icsult 

Still another difference in the newer testing techniques is the 
emphasis on situations that actually enter into a child’s experi- 
ence Sharp’s situations were possible but imaginary. The fol- 
lowing questions devised by Hartshorne and May show the new 
point of view (47, p 17). 

I. If anothci pupil ivants to copy youi noik and hand it in, 
a Let him do it and say nothing about it 
b Let him do it and tell the teacher, 
c Don’t let him do it and say nothing 

d. Don’t let him do it and tell the teacher he wanted to. 

e. Don’t let him do it and tell him you disapprove of cheating. 

5 . If you sec bioken glass in the street, 
a Pick It up 
b Do nothing about it 

c. Tell the policeman about it. 

d. Try to find the one who did it. 

Lastly the newer type of test uses recognition rather than re- 
call. Instead of asking the pupil to put in writing what he would 
do, It gives several possible answers out of which the pupil 
chooses the one he thinks best. This has the advantages of ob- 
jectivity in scoring and amenability to statistical treatment. The 
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older studies using essay-type tests were handicapped m that the 
answers were so varied as to pieclude classification and statistical 
treatment Usually some rough grouping was effected, but the 
results weie subjective and not very satisfactory What the newer 
tests lack in permitting free expression they more than compen- 
sate for by greater objectivity and reliability. 

By far the best tests of this type have been made by Hart- 
shorne and May in their work on the Character Education In- 
quiry These tests will be described in some detail and will illus- 
trate the various t}'pes used by other authors. The form of the 
test may vary however 

These authors divided their tests into three groups: 

A Word Tests 
B Sentence Tests 

C. Good Manners Test (already described on page 264) 

The word tests are four in number, namely, an Opposites Test 
in the multiple-choice form, a Similarities or Cross-Out Test, 
a Word Consequences Test, in which the subject indicates (a) all 
likely consequences that might follow from the action represented 
by the word in capitals, (b) the most likely consequence, (c) 
the best consequence, and (d) the wont consequence; and the 
“Schwesinger Ethical-Social Vocabulary Test.” 

The Word Consequences Test is important for measuring the 
ability to judge the outcomes of certain situations or actions. 
Franzen m his senes of “Health Education Tests” has used the 
matching device to achieve the same result, as shown in the fol- 
lowing sample. 

( ) Wet feet 
( ) Bad cold 

( ) Bedroom 

( ) Garbage pail 

( ) Flies 

( ) Sore throat 
( ) Babies’ milk 
( ) Sick people 
( ) Whiskey 

( ) Dirty dishes 

Reprinted by permission of the American Child Health Association, New 
York City 


I. Keep from breeding 

2 Should not touch other people’s food 

3 Blow the nose gently, not hard 

4 Keep covered 

5. Scald with boiling water. 

6. Should be very clean. 

7. Should not be too warm. 
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The sentence tests of Group B in the Hartshoi ne-May series 
particularly intcicst us hcic 

Cause and effect. This is a tiuc-falsc test of 100 items of uhich 
the following aie samples (29, p 6) 

Some of the statements made below are true and some are 
false Read each statement caiefully and undeilinc the word 
TRUE if It seems to you to be tiue Underline the word false 
if It seems to you to be false 

1 Good maiks are chiefly a matter of luck True False 

2 Ministers’ sons and deacons’ daughters usually 

go wrong True False 

3 If one eats stolen apples he will have a 

stomach-ache True False 

This test IS an endeavor to determine how correctly people can 
reason about cause-eflFcct relations in everyday life The investi- 
gators used the judgments of a graduate class in the psychology 
of character w'ho answered the items to deteimine the key A 
7S per cent agreement of this group was necessary before the 
item was included m the test, with the exception that in one or 
two cases either ignorance or conventional opinion seemed to 
determine the answers In these cases the majority vote of the 
class determined the key, but m one or two cases the class de- 
cision was even revised. 

Duties. This is a yes-no test In it the pupil is asked to state 
whether ceitain acts arc his duty, are not his duty, or are some- 
times his duty and sometimes not (29, p 7) . 

I. To help a slow or dull child with his lessons Yes ? No 

2 To read the newspapers every day Yes ? No 

3 To call your teacher’s attention to the fact if 

you leceived a higher grade than you de- 
served Yes ? No 

In this test again the question of scoring was difhcult The 
j'udgment of the graduate class was even more split than m the 
previous case and often went against the judgment of the in- 
vestigators. The majority opinion of the children might also serve 
as a basis for a key, but in that case it would differ from the 
key prepared by adults. The preparation of a key for this type 
of test strongly emphasizes the degree to which these are matters 
of convention and code. 
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Franzcn (21) uses two other devices to test “duties.” One he 
calls a “Five Rules Test” Ten acts of more or less importance 
aic given in a column The diiections are “In this column mark 
nith an (X) nvn rules you would like to follow foi one month In 
this column maik ri\E lules and no more” Since Franzcn has 
interpolated some duties that arc not specially related to health, 
his test IS also a measure of something like interest in health 
01 the lelatne importance with which health is held 

Franzen’s other test which belongs in this group is called “Time 
Test” Twent>-two health acts are listed. The directions state. 

(A) Mark w’lth an (A) the things in this list you should always 

DO ONCE A DAY OR MORE 

(B) Maik with a (B) the things in this list you should do 

ONCE A WEEK, ONCE A MONTH, OR MORE OFTEN. 

(C) Mark with a (C) those things in this list you should do 

ONCE OR TWICE A YEAR 

( ) Do NOT mark those things that you do not need to do * 

Comprehensions. This is a multiple-choice test of the “what 
should one do” type 

Examples (29, p. 8) . 

1 If some one asks to borrow your pencil, 

(a) Tell him it’s broken 

(b) Tell him that you just lost it 

(c) Tell him that you don’t want to loan it 

(d) Let him take it. 

2 If some one steals your lunch, 

(a) Steal another lunch to even it up 

(b) Report it to the teacher 

(c) Cry about it 

(d) Say nothing about it. 

Provocations. This is a more complicated test which attempts 
to determine the extent to which judgment has the ascendancy 
over wishes, prejudices, emotions, conventions, and the like. 

Example (29, p. 8) : 

Here are some little stories of what some children did You 
are to decide whether they did right or wrong. If what they did 

' Reprinted by permission of the American Child Health Association, New 
York City- - o -• - , . 
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was not quite light, perhaps it was at least excusable m view of 
the ciicumstances. Look at the sample fiist 

Sample Jane’s family vas too pooi to buy fiuit for her sick 
biother. So every now and then Jane took an apple or an orange 
fiom a fruit stand and brought it home to him. 


Now if you think she was absolutely wrong in 
taking the fruit, put a circle aiound the Wr, 
like this 

But if she did exactly right, encircle the R, 
like this 

If you think she was wrong, but excusable in 
view of hei desiie to bung it to her sick 
brother, encircle the Ex like this 


R Ex 
Ex Wr 

R Wr 


Begin heie and do the lest in the same way. 


I. Helen noticed that nearly every one in the 
class was cheating on a test so she cheated 
too 

2 Harry was a Christian boy. One day a Jewish 
boy called Harry “a dirty Christian.” Harry 
knocked him down 

3. Chailes did not want to play marbles for 
keeps but the boys called him a “sissy” so 
he went ahead and played for keeps any- 
way. 


R 

Ex 

Wr 

R 

Ex 

Wr 


R Ex Wr 


In this test it was found that the responses of the graduate 
class tended to be so highly conventional — much like what the 
sixth-grade childien answeied — as to be worthless. For a scoring 
key the exainineis used answers w'hich agreed with “a standard 
that w'ould conform to the gieat historical moral ideals” 

Foresights. This test consists of a number of desciiptions of 
situations. The subject is merely requested to write down as 
many things as he can think of, good and bad, that might happen 
from the events recorded (29, p. 10): 

1. Whenever any one picked on John he would go tell his teacher. 

2. John accidentally broke a street lamp with a snow-ball. 

In a later edition this test is given in the multiple-choice form 
in which a number of possible consequences are listed and the 
pupil IS asked to rate each one according as “this is likely to 
happen”; “this might happen but not likely”; and “this w'ould 
not happen.” 
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This IS 

This 

This 

John accidentally broke a 

likely to 

might 

zoould 

stieet lamp zvith a 

happen 

happen 
but not 
likely 

nut 

happen 

snow-ball 

□ 

□ 

□ 

I. John was arrested and sen- 
tenced to SIX months in 
]ail 

□ 

□ 

□ 

2. John said nothing about it, 
and people thought an- 
other boy had done it 

□ 

□ 

□ 

3. The emergency wagon had 
to come and repair it 

□ 

□ 

□ 

4. He thought it was such fun 
that he smashed a lot 
more lamps 

□ 

□ 

□ 

5. There was an accident 
there because it was 
dark. 

□ 

□ 

□ 

6. Some people were cross 
about It and John’s 
father got into trouble 

□ 

□ 

□ 

7. The glass went on the 
street and a child cut his 
hands on it 

□ 

□ 

□ 

8 The city had to pay for 
the lamp. 


Recognitions. This is a test of ability to classify acts under cer- 
tain headings. The directions state (29, p. 10) : 

After each statement are five letters, C L S X J. If the deed 
is a case of Cheating, draw a circle aiound the C, if it is Lying, 
around the L, if it is Stealing, around the S If it is something 
wiong, but not either cheating, lying, or stealing, put a circle 
aiound the X If it is not wrong at all, put a circle around the J 
If the thing is both cheating and lying or stealing and lying, or 
all three, encircle all the letters you need to in order to express 
your opinion. 

1 Bullying younger children C L S X J 

2 Using street-car transfers that are out of 

date C L S X J 

3 Riding on the back of a truck without the 

driver’s knowing it C L S X J 

Principles. Whatever one’s belief or theory may be as to the 
nature of rational action, it is certain that on some occasions we 
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art tint on the basis of foiescen consequences, but on the basis 
of principles The next test m the C E I senes is a true-false 
test of knowledge of principles (29, p ii) 

1 To master oneself is a greater thing than to win 

a battle True False 

2 Clean speech is a sign of being a “goody- 

goody ” True False 

3. Obedience is of greater importance than honor. True False 

Applications. Next comes a test of the ability to apply these 
principles A situation is described which ends in a dilemma in 
conduct, requiiing the making of a choice Five principles are then 
given, two of which apply m the situation. These two are so 
chosen that one suggests the first alternative as the right one, and 
the other the second If a pupil has his decision already formed, 
he will not be so ready to recognize a principle which applies in 
favor of the opposite side of the issue. 

Example (29, p. 12) 

I Mary saw Helen cheating on an examination She had to 
decide whether she would 

( ) (a) Report it to the teacher. 

( ) (b) Not report it to the teacher 

Here are the five rules, of which two apply to this problem 
Check two and only two in the spaces at the left of the numbers 

( ) (i) Treat others as you would like to have them treat you 
( ) (2) Be true to what is for the good of all, even when 
your own interests or those of your friends are in- 
volved. 

( ) (3) When you have wronged some one, ask to be for- 

given. 

( ) (4) Be cheerful and uncomplaining when disappointed or 

hurt or in trouble 

( ) (s) Do not think of yourself as more important than you 

are 

After checking the two rules that apply to Mary, put a check 
before either (a) or (b), according as you think it would have 
been right for her to tell or not to tell. 

In later revisions a further test is given which carries this 
last test a step further Instead of asking whether or not the 
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principles apply, the pupil is asked to state which principles are 
most important Foi each incident the piinciples for and against 
aie listed sepaiately The test is headed “What Things Would 
Be Impoitant If They Weie to Happen ” 

In the test the pupil is to put a circle around all of the prin- 
ciples he thinks are important in the situation described Then 
he is to put an X befoie the two most impoitant principles, and 
finally a second X before the most important of all 

Example 

I John accidentally broke a street lamp with a snow-ball. 

First Ballot 

a The glass fell and hurt a man who was passing by, 
b His father scolded John for it 
c John blamed another boy for it. 

d It showed him that he had to be more careful when throw- 
ing snow-balls 

e The city had to pay for the lamp. 

Second Ballot 

f. The emergency wagon had to come and repair it 
g A policeman saw John do it and ran after him 
h. The glass went on the street and a child cut his hands 
on It. 

1 John said nothing and people thought another boy had 
done It 

j John paid for it 

Altogether these tests represent a rather complete analysis of 
the different phases of the thought process as applied to con- 
duct There are many theories of the relation of thought to con- 
duct On the one hand some believe that much or most of conduct 
operates without the aid, guidance, or intervention of thought. 
Others believe that thought acts through the application of gen- 
eral principles Still others believe that thought penetrates deeper, 
first sizing up the situation, afterwards considering what the out- 
comes may be, and then deciding in the light of these outcomes. 

The tests called “Duties,” “Comprehensions,” and “Provoca- 
tions” get at the knowledge of what one ought to do or judg- 
ment as to whether an act is right, wrong, or excusable, 
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To test out the control of conduct on the basis of general 
punciples seveial tests are available The “Recognitions” test dc- 
tei mines ability to classify acts under certain headings This is 
important if one is to be able to apply the principles to specific 
situations Then there is a test of knowledge of “Principles” and 
a test of the ability to “Apply” these pimciples in the concrete sit- 
uation Finally iheie is a test of ability to “Weigh” and “Choose” 
these principles according to their importance 

To test the ability to guide conduct by noting its outcomes 
or results three tests aie available. Two of these, the “Word 
Consequences” and the “Foicsights” tests, measuie the ability 
to estimate the piobable outcomes of acts or situations The 
“Cause and Effect” test is moic a test of looking back from a 
given event to the tiue and propei cause 

One other test seems necessary m this lattei process — a test 
of the ability to weigh consequences and make choices in the 
light of the decision Such a test has been piepaied by Chassell 
and Chassell (ii) in the “Test of Ability to Weigh Foreseen 
Consequences ” The test consists of a series of stories. After 
each story is given a list of ten or so possible consequences The 
directions state to “mark with a plus sign (-{-) all the conse- 
quences that seem to you desirable, and with a minus sign ( — ) 
all those that seem to you undesirable ” After this is done the 
pupil IS asked to make a decision in the light of the consequences 
and their importance 

Underline yes if the consequences marked plus seem to you 
the more impoitant, and no if those marked minus seem to you 
the more important 

YES . NO 

This test IS important, and strangely enough it supplements 
the C. E I.’s excellent battery of tests in an important way. The 
test as It stands may be criticized because the stories describing 
the incident to be judged are so lengthy. Reading ability, both 
speed and comprehension, certainly play a large part in deter- 
mining outcomes on the test. In the C. E. I. tests, all of the 
incidents are described in a very brief way. 

Another veiy interesting and suggestive test is the “Story Test” 
devised by Franzen as one of the “Health Education Test” series. 
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It is hard to classify this test as one of knowledge, interest, atti- 
tjde, judgment, or conduct A stoiy is read and the pupil is to 
note with a red pencil all the items relating to health, good or 
bad. Since the stones themselves are mteiestmg, providing a 
good deal to distract the attenUon from the health items, a child 
must overcome a ceifain amount of resistance to turn his atten- 
tion from the thread of the story to the matters of health Exactly 
what the test measures must be learned from the correlations. 

Example. * 

Directions 

Underline cveiythmg that is good for health. 

Cioss out everything that is bad for health. 

Use a red pencil. 

Two things are alieady marked correctly in this story. 


Jean Learns How to Play a New Game 

Jean ran down the long hall thiough the school toward the 
playgiound As she passed the drinking fountain she stopped for 
a long cool dnnk She prised i)lr l)^s cl)fee to the pipe so as 
not to waste a diop of the sparkling water. The girls in Jean's 
gymnasium class \>eie to learn to play basket-ball that aftei- 
noon, ana they were veiy much excited about it They played 
wi'ii the ball, tiying to throw it into the basket, until their time 
in the gymnasium w aS over When they w'ere leaving the teacher 
gave each one a book of lules to look over at home before they 
played again 

Several of Jean’s c'assmates lived near her and on their w'ay 
home fiom school they planned to plav basket-ball all after- 
noon One of die girls bci rowed a basket-ball from her brother 
and Jean’s brother fastened a baud hoop to a tall fence for 
them to use as a basket The giils had a w'onderful time and 
yelled and shouted until they were hoarse and very thirsty 

They all went into Jean's house to have a drink and counted 
out to see who should be first to use the pretty green glass that 
stood near the faucet. Jean’s mother gave each one three cookies 
and the girls sat on the porch to eat them and plan for other 
afternoons practising with the ball One of the girls thought her 
father might give her a basket-ball for her birthday. Jean’s 
mother had told them when she gave them the cookies that it 
was five o’clock so they could only stay a few minutes longer. 

After supper Jean took her book of basket-ball rules and sat 

‘^Reprinted by permission of the American Child Health Association, New 
York City 
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down to read in front of a lamp so that her book was m a bright 
light She studied them carefully and by eight o’clock she was 
sure she would be able to play the game the next time they went 
to the gymnasium 

Jean was very tired and was glad she didn’t have any more 
studying to do so that she >.ould go right to bed. 

Offense Rating 

In this group of tests of knowledge and judgment the ranking 
or rating of offenses should be included In the tests previously 
described, a single judgment was required on a variety of items, 
each one taken separately In the present technique a careful 
ranking of a small group of carefully chosen items is asked for 
instead The method of ranking or paired comparisons has had 
a long psychological history. Fernald was the first to suggest 
that this method be used in ranking offenses, so far as I am 
aware. 

Brogan (3, 4, 5), who has studied the method somewhat ex- 
tensively, asked students in the University of Texas to list the 
“ten worst practices” among students in the university Sixteen 
practices were found as occurring most frequently They are (in 
alphabetical order) Cheating, Dancing, Drinking, Extravagance, 
Gambling, Gossip, Idleness, Lying, Sabbath-Breaking, Selfish- 
ness, Sex Irregularity, Smoking, Snobbishness, Stealing, Swear- 
ing, Vulgar Talk Brogan has had these lists ranked many times 
by various groups according to the “worseness” of the practice, 
and according to the fiequency with which it occurs. 

The following tests by Cushing and Ruch (19), similar to a 
test of the same nature by Raubenheimer (37), who in turn drew 
his material from the “Claik Rating Scale of Offenses,” shows 
the trend toward using described situations with wh'ch younger 
boys would be more familiar. 

Directions 

Below are ten offenses committed by boys in a certain reform 
school. Read them through carefully and find the worst offense 
Mark it i Read them through again and find the next worst 
offense. Mark it 2 Mark the next worst 3, and so on down to 
the least serious, which will be 10 Every rank from i to 10 must 
be used. No tie ranks are permitted. , 
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Begin here: 

Rank 

(a) Associated with hoboes — slept out in caves, tents, 

and boxes 

(b) Bullied the public school in Los Angeles, which 

he attended 

(c) When his fathei staited to take him to a detention 

home foi misconduct, he asked to leturn to the 
house to say good-by to his mother There he 
took a 22-riflc and shot his father, killing him in- 
stantly — 

(d) Ran away fiom home and obtained employment, 

securing his room and board with anothei fam- 
ily. — 

(e) Forged a check for $10 

(r) Played hookey to attend a circus. 

(g) Together with another boy was accused of mur- 
der They had been drinking, were given more 
liquor by two adults, and in a fight which ensued 

one of the men was killed 

(h) Stole scrap iron from railway cars on a siding 

and sold it to a junk man 

(1) Was sent to a parental school because he struck 
his teacher Was accused of drawing a knife on a 
boy tthose bicycle he had stolen, when the boy 
claimed it Arrested for carrying a blackj’ack and 

knife 

(j) Entered the house of the next-door neighbor and 

took $2 50 

Cady uses rating instead of ranking The relative advantages 
of latmg and lanking are discussed on page 76 His plan is to 
have each quality or offense or activity rated on a four-place 
scale according to the amount of blame one would assign to it. 
His directions and a sample of the test follow (10, p 135). 

Here is a list of eight words which describe people who are 
grouchy, extiavagant, nervous, forgetful, etc Divide such people 
into four classes by placing a cross opposite each word according 
to whether you think them: 

(1) very greatly to blame. 

(2) a good deal to blame. 

(3) a little to blame. 

(4) not at all to blame. 

Look at each word Think how you feel about such people 
^nd then place them m one of the four classes. 
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people who are 

( 1 ) 

very greatly 
to blame 

( 2 ) 

a good deal 
to blame 

(3) 

a little to 
blame 

(4) 

not at all to 
blame 

grouchy 

extravagant 

nervous 

forgetful .... 

dishonest 

careless 

impulsive . ... 

unambitious 


































Still another technique is the cross-out test of Pressey. In this 
test a list of words is given and the subject is instructed to cross 
out all the words which stand for things he thinks are bad. 

Illustration. 

Directions Read through the twenty-five lists of words given 
just below and cioss out everything that you think is wrong 
— everything that you think a person is to be blamed for. You 
may cross out as many or as few words as you like, in some 
lists you may not wish to cross out any words Just be sure that 
you cioss out everything you think is wrong 

1 begging smoking flirting spitting giggling 

2 fear angei suspicion laziness contempt 

3 dullness weakness ignorance meekness stinginess. 

The matter of scoring a test -which is in the form of ranking 
has always given some trouble Raubenheimer used as a scoring 
method for his test of offense rating “the sum of the square 
of the deviation m ranking for each item from the standard 
ranking developed by Clark in his ‘Offense Rating Scale.’ ” 
The advantage in squaring the differences is that it increases the 
reliability of the scores. 

Naturally the lower the score in this test, the closer the agree- 
ment with the standard 

Cady brings out the point that in ratings standards vary con- 
siderably, so that when one tries to assign a score to an indi- 
vidual on the basis of his deviations fiom some norm or stand- 
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Table 49 



ScORIXC JUDGMENTS EXPRESSED 

AS Ranks 








Sta,idaTd 

Q puL^U. 


Deviation 

Item 

rank 

\ng the test 

Deviation 

squared 

a 

3 

4 

I 

I 

b 

8 

7 

I 

I 

c 

10 

8 

2 

4 

d 

2 

5 

3 

9 

e 

6 

6 

0 


f 

1 

3 

2 

4 

B 

9 

9 

0 


h 

4 

1 

3 

9 

i 

7 

10 

3 

9 

j 

5 

a 

3 

9 


Sum 46 


ard, his variatioiis may be large not because he judges care- 
lessly or inaccurately but because he holds radically higher or 
lower standards. What one wants is not the average amount of 
deviation of the ratings from some norm, but the average amount 
of the deviauons from their own mean. Kelley* gives for the 
computation of the average deviation a short formula which is 
con\enient to use in this problem. The formula is 

Av. Dev. = FM - 1 x) 

where N = number of cases (things rated) 

F = number of cases lying below the mean 
hi = mean 

F 

Z = sum of the ratings below the mean 

* Kelley, T. L., Statistual Method (The MaanlUan Compaiiy, 1923). 
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True 


values as 


Deviations 

n by compe- 

Ratings assigned 

fiom the 

]udges 

in test 

hue valves 

6S 

6.5 

0 

2 6 

30 

+ -4 

7-9 

7.0 

— 9 

3-2 

4-5 

+ 13 

0.8 

2.0 

+12 

4-S 

4.0 

- -S 

I. 2 S 

1-7 

+ -45 

1. 1 

i-S 

+ 4 

8 0 

6.S 

-i-S 

2.2 

2.8 

+ .6 


Sum of absolute 

7-2S 


values of de- 
viations 

= .7 

10 ' 

+4-3 S 
-2.9 

Sum of algebraic values 

14s _ , 

of deviations 

10 


= .725 av. error 


av. 

am’t 

by 

which 

rat- 

ings 

were 

too 

high 


Calling this value, -I-.145, the mean, 
there are 4 scores (o, — 9, — .5 and 
— 1.5) lying below this mean. 


N= 10 
F= 4 
M = -f-,i4S 


F 

2 = 0— .<) -.5 -1.5 = - 2.9 

I 


Av. Dev. = ^^F M — J 

= ^[4X.i4S-(-2 
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=^(580+^9) 

_ 3480 

S 

Av Dev = .696 

If the woik IS carefully planned, this computation may be 
leadily run off, and the “score” of each individual may thereby 
be obtained with his tendency to overrate or underrate elimi- 
nated 

Reliability 

Reliabilities in this testing have not been very frequently re- 
ported Chassell and Chassell (ii) give the reliability of their 
Test of Ability to Weigh Foreseen Consequences as 91 when 
obtained for fifty-nine junior high school pupils Rauben- 
heimer (37) gives a reliability coefficient of 78 for his offense 
rating test m a thirteen-year-old group The best reliabilities 
are those reported by Watson (47) and the C E I (29). 

Watson’s reliability coefficients are given in the following table 


Table jo 

Rfllibility Coefficients of Test of Conduct Information and Opinion 
(from Watson, 47, p 36) 


Sttbject and type of test 
True-false statements on religious ideas 
“Duties” test — ^multiple-choice 
Yes, doubtful, no test on practices 
True-false test on leadership, obedience, 
patriotism, etc 

Ranking ways of acting in groups of 
three 

Choose best and worst of five alterna- 
tives dealing with problems of boys 
Check Items m list which tend to per- 
vert one . . .. 

Choose best and worst among five alter- 
native ways of acting 
Checking -b ^ or — methods of show- 
ing Christian spirit m home duties 
Yes ' No test on best ways of getting 
on at school 


No of 

Reliability Self-correla- 
coejjicient tion for 

{half 100 similar 

elements 

against half) 

elements 

20 

4S 

80 

20 

48 

82 

133 

90 

88 

10 

90 

99 

IS 

8S 

•97 

8 

70 

96 

17 

79 

9S 

8 

3S 

.87 

II 

.21 

70 

10 

,08 

46' 
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Hartshornc and May give the following reliabilities for their 
tests 


Table 51 


Reliability Coefticients — ^Moral Knowledge Tests 


Opposites 

Coeffictent of 
reliability 
828 

Cause-efFcct 

778 

Duties . . . . 

832 

Comprehensions 

. 805 

Protoc.itions 

733 

Recognitions 

798 

Principles 

688 

Applications of principles 

8io 


Schwesinger gives the exceedingly high reliability of 98 for 
the Ethical-Social Vocabulary Test (40). 

In general these are satisfactory reliabilities and are only 
slightly less than one expects fiom tests of the same type in school 
subjects The leliabilities are fully as high as are obtained else- 
wheie with questions that are matteis of fact or general agiee- 
ment as in the vocabulary test, but they are a bit lower where 
codes and standards vaiy, as is also true of tests of English 
usage Indeed one might well wonder that the tests show as high 
consistency as they do in view of the fact that ethical standards 
depend so laigely on a variable point of view 


Validity 

The question of exactly what significance the various tests of 
knowledge, judgment, and disci imination in the field of conduct 
have IS of gieat impoitance Undeniably these tests have been 
constructed and used m the belief that they have some lelation- 
ship to actual conduct. There is stiong popular conviction that 
knowledge and reasoning exercise potent control over conduct If 
this is true, these tests are of supreme importance, and even 
though it IS true that language tests do not have direct relation- 
ship to specific modes of behavior, they may represent whatever 
integrating force there is for behavior as a whole. In other words, 
even though language tests may not correlate with specific modes 
of ‘conduct, they may be very important as indices of general char- 
acter. What are the facts? 
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Intercorrclations of moral knowledge tests. By far the most 
significant nork in the validation of conduct knowledge tests is 
that of Hartshorne and May They give mtercorrelations for their 
tests as follows. 

Table 52 

Intehcorrelations of C E I Moral Knowledge Tests 
(from Hartshorne and May, 29, p 18) 




2 

3 

12 

4 

5 

6 

7 

9 

10 

11 

X 

1 

Opposites 

X 

X 

748 

750 

X 

362 

197 

X 

S72 

X 


2 

Similarities 


X 

612 

X 

X 

236 

X 

472 

X 

X 

, , 

3 

Word conse- 













qucnccs 

. 

. 

66 s 

X 

X 

137 

X 

440 

X 

X 

. 

12 

Vocabulary. 

• 



4S8 

383 

381 

276 

389 

330 

X 

720 

4 

Causes . 



• . . 


350 

000 

237 

Soo 

SS5 

326 

174 

5 

Duties . 

. 

. • 

• • • 

• • 

• • 

228 

331 

600 

S 7 S 

030 

302 

6 

Comprehen- 













sions 

• • 

« • 

« • 

• • 


• • • 

000 

400 

430 

363 

443 

7 Provocations 

« « 

• • 


• • 

• • » 

• 

, , 

172 

248 

463 

396 

9 

Recognitions 



• « 


• • 

• • • 


• 

276 

258 

478 

10 

Principles . 

• , 

• • 


• • 

• • • 


* , 


, 

164 

438 

11 

Applications 


. 

• • • 

• • • 

• • • 

• • 

• • 


, , 

, 

274 

X 

Good manners 

, • 

, 

• • • 

• • 

• • • 



» » • 

• • • 

, 

* • • 


One should first note that all the correlations are substantially 
positive, indicating that there may be some common factor run- 
ning through all of the tests. The logical grouping of tests which 
was made on page 277 finds no counterpart in the correlations 
Tests 5, 6, and 7, which test more or less directly knowledge of 
desirable conduct, correlate .228, 331, and 000 respectively. 
Tests of knowledge of moral principles and their applications 
(9, 10, and ii) also have low correlations ( 276, .258, and 164) 
On the other hand the vocabulary tests have substantial inter- 
correlations. The correlations perhaps are best explained by the 
mechanical nature of the tests and the psychological kinship of 
the functions rather than by any logical relationship which might 
be imagined The reliability of the tests may be one factor deter- 
mining the size of the mtercorrelations. The matter of standards 
of the answers may be another factor Tests in which there is 
common agreement as to the rightness or wrongness of an answer 
should correlate better than tests in which the standards are more 
subjective. This may partly explain the higher correlation of the 
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vocabulary tests Again, tests of knowledge, even though they may 
meiely minor cuiient codes of behavior (as the tests of duties 
and principles, 575) and the tests of judgment and application 
(such as the tests called provocations and applications, 463) may 
correlate better than tests in one of these groups with tests m 
another (principles and applications, 164) If these aie the fac- 
tors which determine the correlations, it is evident that they are 
not very helpful in deciding upon the possible dynamic re- 
lations between the vaiious phases of knowledge and reasoning 
on conduct But they do indicate that the different types of 
activity are loosely strung together, making it possible for a per- 
son to possess principles of conduct even though he is not in a 
position to estimate piobablc outcomes in action or to apply the 
principles 

Correlations of moral knowledge tests with a composite. Hart- 
shorne and May also found the correlations of their Moral Knowl- 
edge Tests with the weighted composite of the first seven tests 
Assuming that the reliability of the composite is 90, these correla- 
tions, corrected for attenuation, are as follows: 

Table J3 

Correlation or Judgment and Reasoning Tests against a Composite op 
Vocabulary and Knowledge Tests 

(from Hartshorne and May, 29, p 21) 


Causes . . . 521 

Duties . 544 

Comprehensions 372 

Provocations . . . 421 

Recognitions . 581 

Principles 636 

Applications 418 


Correlations of moral knowledge tests with intelligence. Healy 
and Fernald (30) concluded from their use of a moral j’ucfgments 
test that It was useful for estimating a subject’s powers of intellec- 
tual comprehension. Chassell and Chassell found their test to cor- 
relate with IQ 43 and MA 49. Raubenheimer’s test of offense 
rating correlated with the National Intelligence Test .63. Hart- 
shorne and May find that their moral knowledge tests correlate 
with intelligence, as determined on intelligence test material fur- 
nished by Thorndike, as follows: 
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Table S 4 

CoRREL-tTION OF MoRtL KNOWLEDGE 

AND Intelligence 

(from IlaiLshoinc and May, 29, p 20) 

Opposites . . . ... 

77S 

Similarities 

. 664 

Word consequences 

S19 

Cause and effect 

647 

Duties 

402 

Comprehensions 

371 

Provocations 

US 

Recognitions 

498 

Principles 

. +++ 

Applications . 

562 

Vocabulary . 

. 8S2 

Composite of seven tests 

.. 686 


These are substantially high and indicate that general intelli- 
gence IS a large factor in determining a score on a test of moral 
knowledge This is further emphasized by noting that when in- 
telligence IS partialed out of the correlations between a moral 
knowledge test and the composite of moral knowledge, the correla- 
tion drops down close to zero “Duties,” “piovocations,” and 
“piinciples” seem to test something beside intelligence, however 
Correlations of moral knowledge and age. Hartshorne and 
May found the following correlations between age and certain 
tests. 

Table 55 

CoRRELilTION BETWEEN MoRAL KNOWLEDGE TeSTS AND AgE 


(from Hartshorne and Maj^, zg, p 22) 


Opposites 

— 094 

Similarities 

— 068 

\\ ord consequences 

— 204 

Causes 

473 

Duties . . 

138 


Vocabulaiy . 


Comprehensions . 416 

Provocations . — 097 

Recognitions 172 

Principles . 026 

Applications . 183 

. — 091 


F rom these figures it would seem that, in general, moral knowl- 
edge shows little progression with age for children in grades five 
to eight, much less change, in fact, than is usually found with 
school subjects It would seem as though moral knowledge be- 
comes fixed rather early in life or that it is relatively uniform 
thioughout a group such as a class or school Slavcns and Brogan 
(43) in studying judg'fnents of the seriousness of acts, found that 
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high school and college students hold approximately the same 
moral standards, from ^\hich finding he reaches the conclusion that 
these standards become fixed iclatively caily 

Correlation of moral knowledge and conduct. The testimony 
here is remarkably unanimous In 1911 Healy and Bionnei (30) 
concluded that moral judgment has little 1 elation to a subject's 
moral nature Haines (24) found that a moial judgment test did 
not difleientiate between noimal and delinquent girls Lowe and 
Shimberg (35) discovered that the Binct “Fables Test” does not 
diffcientiate delinquents fiom non-dclinquents Schwesinger (39) 
reported that hei test of slang does not separate delinquents from 
non-delinquents Raubcnhcimer (37) found that the test of offense 
rating differentiated between delinquent and non-delinquent 
groups to a degree represented by coriclations of 31, 26, 24, 
and — II Cushing and Ruch (19) found the biserial 1 between 
delinquent and non-dclinquent groups and offense rating to be 
27 Weber, (48) reported a high correlation between the order 
of Brogan’s items lated according to seriousness of offense for de- 
linquent and normal women Branham (2) believes from his study 
of criminals that the defective delinquent can tell more readily 
whether an act is right or wrong than he can tell the enormity of 
the offense Chassell and Chassell (ii) found a correlation of 17 
between their test of ability to judge consequences and ratings on 
conduct 

Hightow^er, from his study of the relation of Biblical informa- 
tion to character and conduct, concludes that a knowledge of the 
Bible has no relationship to any phase of the conduct which he 
tested His study indicates that mere knowledge is not of itself 
sufficient to insure proper character giowth 

Hartshorne and May (27), in the most analytical work per- 
formed in this field, present coirelations found in Table 56 on the 
following page between tests of moial knowledge and cheating. 

These correlations are very low. The conclusion is that there 
is a very low positive correlation between moral knowledge and 
desirable conduct When groups socially selected on the basis of 
conduct are tested with moral knowledge, this relationship is 
also so low as to be distinguishable only with reliable tests These 
conclusions have far-reaching implications They end once for all 
speculation as to a general relationship between conduct and 
knowledge. Hartshorne and May even go further and say that it 
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Table 56 

Relation of Cheating to Moral Knowledge 
(from Hartshorne and May, 29, p 25) 



Home 

School 

Tests 

cheating 

cheating 

Cause and effect 

-fo3i 

— OS 4 

Duties 

-178 

— 296 

Comprehensions 

— 018 

— 301 

Provocations 

— 129 

— 241 

Recognitions 

— 091 

— 181 

Principles 

— 088 

— 247 

Applications 

— 066 

— 402 

Sum of 1-7 

. . — 121 

-38s 


IS not possible to pick out items of moral knowledge which con- 
sistently predict or correlate with conduct 

So much for the bare facts of relationship Various investigators 
have followed the matter further and have discovered other facts 
and relationships that help uncover the basis for conduct knowl- 
edge and judgment. 

Sex differences. The sexes hold the same standards with respect 
to conduct to a remarkable degree Brogan (4) found a high de- 
giee of agreement in the rating of offenses according to serious- 
ness, except as regards smoking and idleness. Probably the differ- 
ences between the sexes in the matter of smoking has broken 
down to some extent since Brogan did his work, and in the mat- 
ter of idleness the difference w'lll also grow less as women enter 
industry, compete with men, and occupy less and less the position 
of an idle class But the conclusions of Snyder and Dunlap (42) 
and of Tanaka (43) are that women are more severe than men 
in expiessing their judgments on conduct issues The bad they 
condemn more harshly and the good they praise more highly 

Ethical standards. The matter of standards is very troublesome. 
In some fields such as health, standards are fairly definite, but 
for a moral knowledge test there are no answers which may be 
commonly accepted as standard, as there are for a test of knowl- 
edge in history or science. Who is to be the arbiter? Hartshorne 
and May found that the j'udgment of a graduate class in character 
measurement often resembled the responses of a class of children 
more than it did tiaditionally great historical moral ideals If to 
test means merely “to suivey,” then the tests cannot be scored 
in the usual sense. Ev^ry answer must be taken at its face value 
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But if we giant lliat there are standards and that some kinds of 
conduct aie better or more important than otheis, then the test 
may be scoied Hartshorne and hlay wondcied nhether childien 
may not have their own conduct codes as contrasted with the 
adult or ideal standard, and whether this child code may not have 
some positive relation to then conduct They made a key for all 
their tests based on children’s typical or modal i espouses The 
correlations of four of the tests w'lth deception as measured, by 
performance tests W'eie +015, +071, —058, and 115 These 
are even closer to zero than when the ideal key was used, indi- 
cating that the matter of standards is not the factor which elim- 
inates the relation with conduct. In the social-moral area, the best 
available expert judgment must be used to determine standards 
The personal or abstract answer. That one’s abstract judgment 
of what should be done in a moral situation and his statement of 
what he w'ould do himself may not agree has ahvays been recog- 
nized On the other hand, since the two responses are both verbal, 
the relation should be consideiably closer than the relation be- 
tween knowledge and conduct itself Hartshorne and May found 
a correlation of .77 between two tests of duties, one in which the 
questions are stated. 

It IS my duty 

To help a slow or dull child with his lessons. . Yes ? No 
To read the newspapers e\ery day Yes ? No 

and the other in which the questions are stated 

Did you ever help a slow or dull child with his lessons^ Yes No 
Do you read the newspapers every day^ Yes No 

Cady raises a similar question when he inquires whether the 
response will be the same when the pupil is thinking of the situa- 
tion with respect to a large group and when he is thinking of it 
with respect to a small group. This leads to the larger question 
of the relation of the response of an individual to the group of 
which he is a part and with which he has been associated 

Moral knowledge and the conventional code. An old dispute 
in ethics raged around the issue whether one determines his 
course of action by a consideration of the consequences of an act 
or by the application of general principles The answer that mod- 
ern experimentation gives to this question Ik that neither is a cor- 
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rect description of what happens Habit is the main control of 
conduct But there can be no denying that the question is a real 
one, for we are constantly making decisions, some of ■nhich issue 
into conduct Our tests make it clear that actual foresight of con- 
sequences IS a raic method of solution, and the consensus of opin- 
ion of those riho have studied tests and then results is that chil- 
dren’s knowledge and judgment of conduct reflect the conventional 
code or codes Snyder and Dunlap (44), who tested college stu- 
dents on their judgment reactions to a large number of personal 
experiences, conclude that there is sometimes a conflict between 
the traditional and the reflective as evidenced by the high variabil- 
ity of response, but that the traditional dominates over the re- 
flective except in cases where the welfare of others is concerned 
Interesting evidence in this connection has been obtained by 
Hartshorne and May (29), who gave their moral knowledge tests 
to fathers, mothers, teachers, Sunday-school teachers, and play- 
mates of a certain group and found which groups had a code most 
like that of the children being tested The following correlations 
were reported: 

Table 57 

CoRRELATIOSlS BETWEEN hlOBAL KNOWLEDGE OF CHILDREN AND OF 

Other Groups 

(from Hartshorne and May, 29, p 4.3) 


Childien and father . ... 

40 

Children and mother 

49 

Children and both parents 

S+S 

Childien and teaeher 

028 

Children and Sunday-school teacher 

. 002 

Children and club leaders ' 

137 

Children and playmates 

3 S 3 


These facts would indicate that children obtain their codes 
largely at home and from their playmates It might be said that 
the influence is proportional to the amount of time spent together 
and the amount of sharing of experiences. These same investi- 
gators found that the codes of children vary according to the 
group in which they are tested. 

Moral knowledge and group conduct. Hartshorne and May 
have further demonstrated that the relationship between knowl- 
edge and conduct is strikingly higher when groups are considered 
as a whole rather than- as individuals. ' - 
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Table 58 

Correlations detmcen Moral Knowledgl and Condlct tor Indimduals 
AND Groats — Aalrvce or Eight I’lsis 

(fiom Ilartihornc and ^lay 29, p 4(.) 


Bchav lor 

C 

(undesirable) 

Initiz-idual 
— 25 

Croup 

— 44 

Bchav lor 

A 

(undesirable) 

— 13 

— IS 

Behai lor 

II 

(desirable) 

. + 23 

+ 53 


These facts help to explain the oiigin of oui knowledge of con- 
duct and our conduct codes Hartshoine and May find m them 
indications that standaids of conduct operate thiough group ac- 
tivity as opposed to individual activity 

‘‘Since the gioup I’s aie laigei than the individual I’s, they can- 
not be accounted foi by a causal relation between moial knowl- 
edge and conduct, since this relation could operate only thiough 
the minds of the individuals concerned Hence the superiority of 
the group r’s must' be due to the reaction of indiv'iduals to some 
influence which tends toward higher code and moie social conduct 
(and vice versa) without these being integrated in the minds of 
the individuals Such a common influence might be excited by 
the gioup as a whole thiough a growing tradition or by the 
teacher 01 by school system, or by all three No mattei how much 
It affects either conduct or code foi the better, if the correlations 
indicate the absence of individual integration, this improvement 
can haidly be regarded as growth in character” (29, p 68 ) 

Acceptance of this last statement vv'ould seem to depend on how 
character is defined Its authois hav'e not been uninfluenced by a 
wish as to w'hat chaiacter ought to be, as contrasted with the atti- 
tude of taking things as they are found 

Miscellaneous Factors Associated with Moral 
Knowledge 

Snyder and Dunlap (44) in their study of the moral judgments 
of college students uncovered other factors which determine moral 
judgment The magnitude of the act (meaning the seriousness of 
the consequences, the number of people involved, the amount of 
damage done) is one factor on which judgment is based. Brogan 
discovered that there is a negative correlation between the fre- 
quency of an act and judgment as to its seriousness Acts that are 
frequent will not be judged to be serious. 
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Motive back of an act is also a factor used in judging 
the seiiousness of an act Usually the motive must be guessed 
at from the description of the act printed on a test blank. But 
the motive is equally hidden in actual life situations It is the 
u riter’s opinion that the educational outcome is saner and 
leads to better integration when greater attention is paid to the 
boaal outcomes of behavior than to the motives leading to the 
behavior 

Other factors which influence judgment are the condition of the 
peison affected by the act, and the degree of personal sacrifice in- 
volved in performing the act Social virtues are moie highly es- 
teemed than those involving mere individual well-being Tanaka 
(45) found that Japanese children agreed tolerably well in evaluat- 
ing issues concerning state matters and foreign countries, but that 
there was considerable difference of opinion in judging personal 
standards of conduct. 

Conclusion 

It IS possible to measure knowledge and judgment with refer- 
ence to conduct thiough the application of seveial useful tests 
which have been constructed for measuring health. Biblical knowl- 
edge, ethical knowledge, etc These tests have very satisfactory 
icliability, comparing favorably with tests in the school subjects, 
similarly constructed They correlate somewhat with each other 
and substantially with intelligence mgeneial The correlations with 
conduct are very low, so that with the less perfect tests they seem 
to fail to differentiate between noimal and delinquent individuals 
Conclusions from the research work done with these tests seem to 
indicate that answers reflect the code of the group in which the 
individual happens to be rather than any reasoned solution to the 
pioblem situation presented for judgment These codes seem to be 
group affairs, and there is a distinct coi relation between conduct 
and knowledge when groups as a whole are considered The low 
correlations of knowledge and conduct for individuals indicate how 
distinct the two forms of activity are. On the other hand, when 
these correlations are compared with the correlations between dif- 
ferent forms of conduct, there is ground for the suggestion that 
pel haps knowledge and judgment of conduct constitute after all 
the one force, however ineffective, that works toward integrating 
conduct. . ^ 
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Chapter IX 

PERFORMANCE TESTS 


T he most obvious method of measuring conduct is by means 
of the direct recoid of actual conduct itself This method 
has had an extensive try-out in the psychological labora- 
tory, and more recently theie have been excellent field studies 
using these difficult techniques The measurement of conduct pre- 
sents certain difficulties that make it costly and exacting In the 
first place conduct does not usually leave behind a record which 
can be studied at leisure In the testing of ability, for instance, 
we can give an individual certain tasks to perform such that his 
performance of these tasks will itself provide a record which can 
be studied, evaluated, and utilized as a measuring stick But to 
obtain a record of conduct necessitates considerable maneuvering 
and arrangement of the situation Hartshorne and May, for in- 
stance, have given children school tests in aiithmetic, not for the 
purpose of discovering their abilities in aiithmetic but for deter- 
mining whether the children would cheat when given the oppor- 
tunity Mailer* also used tests in arithmetic in older to determine 
whether children prefer to work foi themselves or for the group. 

Observation may be used in the direct measurement of conduct, 
and certain techniques have been successfully worked out recently 
using observation methods Ordinarily, however, this requires con- 
siderable skill and practice, and the method can never be as objec- 
tive as when some lecord is left behind to be studied 

A second difficulty which the direct measurement of conduct 
presents is that of keeping the pupil unaware that he is being 
measured In order to disarm the person being measured he must 
be thrown off the scent, as it were, by telling him to do one thing 
while at the same time giving him the opportunity to do some- 
thing else When one speaks of the measurement of conduct, he 
means measuring the number of times certain responses follow 

■■Mailer, J B, Cooperation and Competition, Teachers College Contribution 
to Education, No 384 (19^9). 
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ceitain antecedent situations For instance, to measure cheating 
Haitshorne and hlay (10) counted both the number of times a 
child would cheat and the amount of cheating he did, when given 
the opportunity. But if they had told their subjects that they 
weie watching to see if any one cheated, that would have been 
a new element in the situation, the children would have been on 
their guard, and the cheating presumably would have been much 
less The direct measurement of conduct cannot issue from a 
frontal attack, as the measurement of ability can. It must catch 
the subject unawares 

May and Hartshorne have constructed a complicated schema 
to describe this distinction between the measurement of conduct 
and the measurement of ability They have classified situations 
into (a) natural, such as would ordinaiily be met in the course 
of life’s experiences whether the psj'chologist happened along or 
not (these natuial situations are further divided into uncontrolled 
and conti oiled, the lattei being planted so as to occur when and 
where the experimenter may obseive the results) and (b) experi- 
mental, which are so artificial and unusual as never to occur ex- 
cept when set up by an investigator They have also classified 
responses into (a) natural (either controlled or uncontrolled) 
and (b) experimental These situations and responses are then 
paiied in every possible grouping as follows. 

Natural uncontrolled situation — natural undirected response 
Natural uncontrolled situation — natural directed response 
Natural unconti oiled situation — experimentally directed response 

Natural controlled situation — natural undirected response 
Natural controlled situation — natural directed response 
Natural controlled situation — experimentally directed response 

Experimentally controlled situation — natural undirected response 
E.xpeiimentally controlled situation — natural directed response 
Experimentally controlled situation — experimentally directed re- 
sponse 

This schema is helpful in emphasizing the distinction between a 
test and the measurement of uncontrolled conduct. There seems to 
be some confusion, however, in trying to make a distinction be- 
tween the situation and controlling the response. Certainly any- 
thing done to control the response must be part of the situation. 

“■May M A, and Hartshoinc, II, ‘Objective Methods of Measuring Char- 
acter,” Pedagogical Seminary, 32 45-67 (1925). 
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The dislaiictioii these authors are trying to make is between 
the naturalness of the situation (whether it occurs in the ordinary 
expel lence of the individual) and the directions given to the indi- 
vidual being measured But these aie both part of the situation 
By the directions we determine whether we are testing ability or 
measuiing conduct In considering the naturalness of the situation 
ne laise the question of the transfer value of the measurements 

This bungs us to a third and fundamental difficulty in using 
actual peifoimance as a measuie of conduct If transfer is geneial, 
and if theie is high correlation between various exhibitions of the 
same trait, then the laboiatory situation is as valuable as the 
uncontrolled natural situation for the measurement of conduct. 

This point of view is illustrated by the following anecdote: 

“A gentleman advertised for a boy, and nearly fifty applicants 
presented themselves Out of that number he selected one and dis- 
missed the rest ‘I should like to know,’ said a friend, ‘on what 
ground )'ou selected that boy, who had not a cmgle recommenda- 
tion'’ 'You are mistaken,’ said the gentleman ‘He has a great 
manv He wiped his feet when he came in, and closed the door 
after him, showing that he was careful He gave his seat instantly 
to that lame old man, showing that he was thoughtful He took 
off his cap when he came in, and answeied mv questions promptly, 
showing that he was gentlemanly He picked up a book w'hich I 
had purposely placed on the floor, and placed it on the table, and 
he waited patiently for his turn instead of pushing and ciowding, 
showing that he was orderlv and honorable When I talked to him, 
I noticed that his clothes were brushed, his hair in order When he 
wrote his name, I noticed that his finger-nails wxie clean Don't 
you call these things letters of recommendation ” 

If, on the other hand, the transfer is small so that the vaiious 
performances going under the same trait name correlate only 
slightly with each other, the response to the laboiatory situation 
cannot be used as a measure of the response to the natural situ- 
ation If this latter be the state of affairs (and we shall find that 
It is), then we must carry out our measurements in the exact 
situations which we think important. For then the response has 
no reference beyond the narrow situation in which it arose. 

Let us not anticipate our conclusion here. Interest in a scien- 
tific attack on the problems of conduct is widespread, and the 
pioblcm IS a practical one We aic interested in selecting persons 
who will be successfub in business and industry, or in advising 
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individuals so that they may make the most happy occupational 
adjustment We are interested in measuring a person’s adapta- 
tion to his surioundmgs or his ability to adapt himself so that if 
lie IS mentally unhealthy he may be given piopci care and tieat- 
ment We are interested m discotermg those uho are apt to 
become enemies of society so that they may be reeducated or 
segregated befoie the trouble arises In other words we are not 
so much interested in whether a child eats candy between meals 
w'hen It IS given to him, or cheats in a test in arithmetic when the 
teacher is out of the room and the lesults will affect his chances 
for piomotion, or fails to continue doing his homew'ork when a 
fire engine clangs past the house, as in whethei he has healthy 
habits, or is honest, or is studious If these specific responses to 
these specific stimuli aie symptomatic to a slight degree only, w'e 
must acknowledge the facts and base out conclusions and practice 
accoidingly. 

In the following pages the outstanding attempts to measure 
conduct directly will be dcsciibcd The techniques themselves will 
be desciibed, and the method of scoring illustrated Reliability 
figures and intei correlations w'lll be given when available Finally, 
generalizations will be made as to the value and significance of 
the direct measurement of conduct 

Before going into detail on the various tests, let us first stop 
for a brief summaiy of the historical setting of the work on these 
tests Although the earliest w'ork on the dynamic factors of the 
personality, earned out stiicdy in the psychological laboratory, 
had only an abstract, theoretical interest, it set the scene for the 
later piactical applications The applied work on these perfor- 
mance tests of conduct has been earned out within the following 
aicas of study 

1 The characteiistics of delinquents 

2 The problems of cheating in school tests and examinations 

3 The value of certain educational programs (such as the Boy 

Scout movement) for the development of character. 

4. The problems of persistence, caution, etc. 

5 The character traits of gifted children 
6. The development of young children 

These various fields of investigation have been more or less dis- 
tinct, and only more recently have they tended to coalesce and tG 
borrow techniques and findings from one pother. 
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One outstanding piece of woik was that of Voelker (17) His 
investigations were an outgrowth of the Indiana Suivey of Reli- 
gious Education (i) undertaken in 1920 and 1921 The collapse 
of the ill-fated Intcichuich World ]Mo\cmcnt, nhich was born 
on the wave of enthusiasm following the World War, necessitated 
the curtailing of the original plans foi the survey, and the lesults 
weie salvaged by the Committee on Social and Religious Surveys 
of New Yoik Citv The committee in charge of the curriculum 
could not agree upon the advisability of using a set of “moral 
conduct tests ’ in the survey, and since both agreement and funds 
weie lacking, the noik was dropped Interest in tests of this type, 
however, was maintained, and P E Voelker made them the 
subject of a doctor’s dissertation at Columbia University In this 
pioneering and notable noik he had the assistance of the Boy 
Scout organization, the encouragement of W S Athearn and others 
of the Indiana Suivey committee, the direct help and guidance of 
E L Thorndike, and statistical advice fiom T L Kelley. 

Kelley earned the most promising ideas of this work with him 
to Stanford, and a year or so latei we find V M Cady (4), work- 
ing under the guidance of Teiman and Kelley, studying the prob- 
lem of incorrigibility, and using some of the tests originated by 
Voelker Later Raubcnheimer (15) at Stanford worked on a bat- 
tery of tests which should piedict tendencies toward delinquency 

Interest in the development and reorganization of religious edu- 
cation continued, however, and converging demands from various 
religious organizations finally led the Institute of Social and Re- 
ligious Research to supply funds for the establishment of the 
Character Education Inquiry in 1923 Hugh Hartshorne and 
IMaik A May were selected as investigators, and the wisdom of 
the choice has been demonstiated by subsequent results Orig- 
inally planned to be a thiee-year study, starting in September, 
1924, It was eventually e.xtended another Uvo years, till 1929 

The first year’s program consisted of a study of tests of honesty 
and trustworthiness , extending and refining techniques originated 
by Voelker. These tests were followed by other groups of per- 
formance tests to measure helpfulness, inhibition, and persistence. 
At the same time tests of moral knowledge and attitude were 
studied Finally the wdiole battery of tests which had been de- 
veloped by the Character Education Inquiry was given to over 
850 children, and a si|udy of interrelationships and integration 
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was undertaken The inclusiveness and high standards set by the 
two investigators has established the study of peiformance tests 
in a position unrivaled in other fields of educational research. 


Tests of Honesty 

These tests have been studied so extensively and repoited so 
completely and ably 111 Hartshorne and May’s Studies in Deceit 
and Studies in Service and Self-Control that they will receive 
somewhat summary treatment here. The reader is referred to 
the books themselves for a fuller explanation. In the first place, 
deceit IS divided up into 

1 Cheating 

2 Stealing 

3 Lying 

Test of cheating, i Tests of copying Since much of the cheat- 
ing in school IS supposed to be copying from one’s neighbor, it is 
natural that efforts should be made to discover the amount of this 
practice Gundlach (9) tried the technique of seating students in 
pairs and giving some pairs identical test papers and other pairs 
dissimilar papers. When each member of a pair had the same 
paper, the number of errors was reduced (10 per cent) and the 
number of identical errors increased (18 to 20 per cent). But one 
does not know from this procedure how many or which indi- 
viduals actually cheated Hartshorne and May used test papers 
in which small changes were made m the questions which would 
be unnoticed except under close scrutiny, and tried to measure 
the amount of copying by noting the agreement between papers 
of pupils who sat side by side However, they were not able by 
this method to differentiate accidental agreement from agreement 
due to copying This might be done by noting the amount of 
agreement that exceeds what would be expected by chance, but 
such a method requires specially prepared test material 

2 The duplicating technique The first form of this technique 
was devised by Voelker and called by him “Completion Test,” 
although the name “Paraffin Paper Test” is more suggestive. 
Voelker’s (18) description of the test is as follows' 

This test IS given on a prepared four-page folder, the sentences 
with the blanks being on page one, and the completed sentences 
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being on page four Page two is entirely blank Page three has 
a coating ol paiafhne 

“Diiections The method of giving this test is as follows A 
folder IS placed befoie each subject, face side up The subject 
is told that the completed sentences aie on page 4 and for that 
leason he is not to look on page 4 The examiner remains in the 
room to see that those instiuctions aie obeyed. 

“When the time is up, the subject is requested to open the 
foldei and to place it on the desk before him in such manner 
that he can see pages i and 4 at the same time This procedure 
will lessen the chances of the subject’s discovery of the paraffine 
on the inside of the folder, which contains a record of his effort 
to complete the test The subject is requested to scoie his own 
papci, using page 4 as his model During this part of the work 
the examinci absents himself to give the subject opportunity to 
cheat if he desiies to do so 

“A compaiison of the record made on the waxed surface, with 
the lecoid as handed in on page i will reveal whether the sub- 
ject attempted to cheat 

“Scoiing Score the subject 10 if he made no attempt to cheat 
Score him o if he cheated.” 

Cady, who used this test, had the pupils score themselves both 
on the sum of eriors made and for the number of spaces not filled, 
the latter being an additional stimulus to leave as few spaces un- 
filled as possible Cady used the method of scoring for the pres- 
ence or absence of cheating as described by Voelker (called fact 
scoie), and also used a score which represented the number of 
changes made or the amount of cheating (called amount score) 
“Graded scores do not correlate any more highly with the criterion 
than do the two interval scores.” 

Cady found the reliability of the paraffin-paper completion 
test of fifteen items to be 578, using only “fact” scores Scoring 
for “amount” showed about the same reliability This test corre- 
lated only 188 with the cnteiion of incorrigibility 

Hartshorne and May used a simpler “duplicating technique ” 
After the questions of any kind of obj’ectlve or short-answer test 
had been answered, the papers were collected and taken away, 
and a duplicate of each paper was made by clerks At a later time 
when the papers were returned to the pupils together with an 
answer key, the pupils were instructed to score their papers, the 
examiner piesenting all the while every opportunity to those who 
wished to change their answers to agree with the key The test 
papers were then compared with the original duplicates and 
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C\RDDOARD Test 

\V?it for tlic ■sigiul for each tiial Put the point of your pencil on ihe cioss 
at tlie foot of the oi.il J hen uhen the signal is pncn shut joui ejes und put 
the figure I in each circle, taking them in order hor the second trial, put the 
figure z in cath circle For the third tii.il, the figure 3, etc After each Inal 
put a check niaik in the scoie box undei the number of each circle vou suc- 
ceeded in striking, count the checks and enter the total m the column headed 
r at the right of the score box After the last trial add up Column T. This 
IS your score The maximum score is 50 



From Hartshome and May, Stmhes m Deceit (The Macmillan Company 
1928) By permission of the Macmillan Company, publishers 
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changes noted. This simple and effective device was expensive 
and time-consuming, ho\\e\cr, and has not been used exten- 
sively 

j The improbable achievement technique The earliest form of 
this technique was devised by Voelker, who called it the “Card- 
board Test” Both Cady and Hartshoine and May used the 
same device The figure on page 305 shows the foim used by 
the Charactei Education Inquiry The diiections indicate the 
method. 

All three experimenters tried out the possibility of completing 
the test honestly Voelker, using five circles, reports, “This feat is 
impossible of accomplishment as fai as it has been possible to 
determine The examiner himself, after long practice and fifty 
successive tests, was unable to score a single success No trust- 
worthy subject has been found who is able to accompli di this 
feat.” Cady says, “The chances of obtaining a peifect score in 
the ‘circles’ is very remote One would be required to hit each 
circle, large or small, in every one of the five trials ” Ten adults 
tried to do this, each making ten trials In no instance, despite the 
practice acquired, weie both circles 3 and 4 hit by the same person 
in his ten tiials Hartshornc and May verified this with school- 
children under conditions where there was no possibility of 
cheating 

Similarly Cady and also Hartshorne and May used “nested 
squares” and “mazes” as coordination tests In these tests the 
pupil was instructed to draw his pencil in the pathway betiveen 
squares or through mazes without touching the sides, tasks which 
are impossible with both eyes shut Cady was careful in his forms 
to have the first exercise so easy that theie was a possibility of 
performing it successfully “The situation is one in wdiich honesty 
on the part of the subject appears to him to be taken for granted 
Integrity of purpose is implied in the care with which the ex- 
periment IS carried out.” 

Voelker merely gave a score of 10 or o according to whether the 
subject gave evidence of having cheated or not Hartshorne and 
May worked out both “amount” of cheating and “fact” of cheat- 
ing scores for all their tests. In these particular tests the amount 
score was the number of corners correctly turned in the squares 
and mazes and the number of figures correctly placed in the 
circles. The fact scorfe (cheating) was 14 or more corners in the 
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squares, 13 or moic maiks m the circles, and 32 or more corners 
in the maze 

Cady (4) finds that his test of “ciicles” coiielates with his 
“maze” test .744, and this may be used as a measure of reliability. 
With forty-four cases the mtercorrelations are. 

Table 59 

Intercorrelatioms or Circles, Spaces, and Mazes Tests 
(From Cady, 4, p S 7 ) 


Circles and 

spaces 

592 

Circles and 

mazes 

513 

Spaces and 

mazes 

630 


Each of the three correlates with the criterion of incorrigibility as 
follows . 

Table < 5 o 

Correlations of Circles, Spaces, and Mazes Tests with Incorrigibility 
(from Cady, 4, pp S 4 . 57 ) 

ISO cases 44 cases 


Circles 318 526 

Spaces 297 324 

Mazes . . . 524 

Circles plus squares . . . 398 


Hartshorne and May (10 II, p 97) find the reliability for their 
squares, circles, and mazes tests based on mtercorrelations (pre- 
dicted by Spearman-Brown formula) to be .721, and based on 
retests to be 750 

Hartshorne and May also used the “improbable achievement” 
technique in the following forms, (a) puzzle peg, (b) the fifteen 
puzzle, and (c) weight discrimination test 

To illustrate the method, the last form only will be briefly 
described Seven small pill-boxes weie filled with cotton batting 
and buck-shot so that their weight m giams was (a) 3 6, (b) 3 7, 
(c) 3 8, (d) 3 9, (e) 4 o, (f) 4 I, (g) 4 2 The differences between 
adjoining boxes were so slight that they could not be detected, 
making it impossible to arrange them in order of weight except 
by chance Numbers from one to seven were written on the 
bottom of the boxes. 

“The instructions were to turn the numbers down and arrange 
the boxes in the order of their weight. Aftfer the first trial the 
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pupils were told to look at the numbers on the bottom and copy 
these numbers off on the score sheet to show how they had been 
arranged They were then told that the coirect arrangement was 
the serial order i, 2, 3, 4, S, 6, 7, and were asked to turn the 
numbers down again and not look at them during the second 
trial.” (10, p. 59 ) "' 

Amount of cheating was determined by counting the number 
of boxes in correct order in any one trial A table was drawn up 
for determining the amount of cheating when combining the scores 
on the two trials which were given The fact of cheating was 
credited if a score of seven (all seven boxes m correct position) 
was made in the second trial; or a score of five on the second trial, 
piovided the scoie was five or seven on the first trial. 

The reliability of this test as determined by mtercorrelations 
IS 750 

4 Double testing technique The overstatement test is one of 
the most widely used tests employing the double testing technique. 
It was first devised by Voelker, developed by Cady and Rauben- 
heimer and Ruch, and later studied by Cushing and Ruch (7), 
and Woodrow and Bemmels (20). 

Ratibenheimer used it as follows (15, p. 39). 

First there was a practice form, introduced m order to motivate 
the desire to make overstatements (Voelker even offered a prize 
of a box of candy to the boy making the highest score.) 

The directions read. 

How many of the 10 things mentioned below can you do? We 
want you to mark yourself on each one in this way: 

I can do it very well 2 points 

I can do it fairly well i point 

I know nothing about it o 

Now go through the list and give yourself a grade of two points 
on each one that you can do very well; give yourself one point on 
each one that you can do fairly well; and give yourself o on each 
one that you know nothing about at all. 

A perfect score is 20 points. We want to see how many of you 
can get a perfect score. After you finish we will have you stand 
up and tell your score. 

•From Ilartshorne, H, and May, M A, Studies tu Deceit (1928). By per- 
mission of The Macmillan Company, publishers. 
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Begin here. Score 

I. Can you swim^ points 

2 Can you skate on i oiler-skates^ points 

3. Can you diive a cat? points 

4 Can you ride on hoiseback^ points 

and so on for ten questions. 


Then came the test itself. The directions are similar to those 
used in the practice sheet Each pupil is asked to late himself 
2, I, or 0, according to his confidence in ability to answer the 
questions. Some of the questions are (15, p 40). 

Points 

1 Do you know who discovered America^ 

2 Do you know who saved the life of Captain John Smith ^ 

3 Do you know much about the Boy Scouts ? 

4 Do you know u here the Pyramids are? 

S- Do you know how to play checkers^ 

and so on for forty questions. 

Finally a test is given to determine whether the pupil is really 
able to answer the questions which he claims to be able to answer. 
The test is in the conventional multiple-choice form. 

The first five questions are (15, p. 44) . 

1 America was discovered by Drake, Columbus, Balboa, Cook 

2 The Indian girl who saved Captain John Smith was Pocahontas, 
Uncas, Hiawatha, Minnehaha 

3 A Boy Scout must not eat candy, ride horseback, chew gum, 
smoke cigarettes 

4 The pyramids are in Arabia, Palestine, India, Egypt. 

5 The king-row is used in checkers, cards, dominoes, croquet. 

The score on this test is expressed in percentages of overstate- 
ment or understatement This is calculated in the following 
manner. 

Score on Part I — the points which the subject gave himself. 

Score on Part II — (the rights minus 1/3 the wrongs) times 2 
Final Score — the per cent that the score on Part II is less or more 
than the score on Part I. 

Thus: A subject gave himself 50 points on Part I. He had 20 
Items correct, 6 were wrong on Part II His score on Part II was 
20 minus 1/3 of 6, which is 18, multiplied by 2, viz, 36 His final 
score, therefore, was overstatement to the extent of 28 per cent. 
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Cady reports a correlation of 505 between the estimate of 
ability and estimate of knowledge and gives the reliability as 579 

Raubenheimei found a leliability of 76 for his overstatement 
test, which IS raised to 86 by the Spearman-Brown formula for 
the two forms combined Terman reports a reliability of 78 for 
Raubenheimei ’s Overstatement Test 

Cady found his o\ ei statement test to correlate 414 with the 
criterion of incoirigibility Raubcnheimer found biserial r’s of 63, 
47, 48, and 3- between the most stable individuals in normal 
school gioups and delinquent boys in a parental school Cushing 
and Ruch found a biseiial 1 of 13 between noimal and delinquent 
girls Woodiow and Bemmels found correlations of 56 and 43 
between chaiacter ratings and the results of an overstatement test 
for children in kmdergaiten and a nursery school, lespectively. 
Evidently this test has symptomatic value in differentiating be- 
tween children who adjust themselves well and poorly to the social 
life of the school. 

A variation of this Overstatement Test known as the “Books 
Read Test” is so similar to it tliat it will be described here, 
although It does not in reality use the double-testing technique 
This test, which is ascribed by Terman to Knight, claimed by 
Ruch as originating w ith himself, and credited by Raubenheimer 
to Franzen (8), was first published by Raubenheimei The test 
consists of a number of book titles, some of which aic fictitious 
The pupil IS to check those which he has read The score is the 
number of fictitious titles checked, “it being thought that such 
misstatements on the part of the subject might be an index to his 
mental honesty” Hartshoine and May ciiticize this test on the 
ground that it is impossible to differentiate between “errors of 
honest lecognition” and dishonesty Raubenheimer reports, how'- 
ever, that the test which he used wuth ten fictitious titles in each 
form showed reliability of 59 raised by the Spearman-Browm 
formula to 74 for both forms together. The test shows biserial r’s 
of .41, 31, .45, and .37 in differentiating betw'een stable boys in 
normal groups and delinquent boys in a parental school 

The most successful form of the double-testing technique, de- 
vised by Hartshorne and May, is the technique for measuring 
cheating wdiich they have used most extensively In brief it con- 
sists of two equivalent forms of the same test, one given under 
conditions where no theating is possible, and the other under 
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conditions where cheating is possible Usually this has been accom- 
plished by passing out an answei key with each paper the first 
day, telling each pupil to use it to scoie his paper with, but to 
keep It out of sight until he is ready to score, and then giving the 
equivalent form of the test witliout answer sheets on the second 
day 

Given two forms of the test equivalent in difficulty at all levels 
of the scale, it is possible to estimate statistically exactly how 
-much variation between scores on the first and second tests there 
may be without cheating Then if the difference between the first 
and second tests exceeds this figure, cheating may be suspected 
or even asserted as a fact. 

For instance, m one of the arithmetic problem tests which were 
used. It was found that on repeating the test under conditions such 
that there w'as no possibility of cheating either time there was an 
average gain of i 06 points (which we call practice effect), but 
that there was a variability of gains (or losses) wdiich amounted 
to 3 10 points (standard deviation) The chances are less than 3 
out of 1,000 that an error as large as 30 or 9 3 points will be made 
But since the practice effect is i 06, we may expect the second 
score to be lower than the first score by I i minus 93 or 82 
points only 3 out of every 1,000 times Aibitranly assuming this 
chance to be negligible, Haitshorne and May decided that when 
the second score was 8 points or more lower than the first score on 
the arithmetic pioblem test, there was cheating This they took as 
their “fact” score They arbitrarily took a deviation of 2 92 stand- 
ard deviations as the dividing line between the certainty of cheat- 
ing and not cheating and used this unit as the “amount” score. 

Tw'o types of material were used with this technique One type 
of test show'ed very little practice effect and was dependent hardly 
at all on speed Four of these difficulty tests were used as follows: 

1. Arithmetic problems 

2. Completion sentences 

3 Information test items 

4 Vocabulary test items 

Speed tests were also employed using the same technique. 

I. Test of adding two-digit combinations 

2 Number checking 

3. Cancellation of A’s 
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4. Digit symbol substitution test 

5 Making dots m small squaies 

6 Cancellation of a single digit 

In these speed tests cheating may be accomplished by working 
beyond the time limit, so that no key is needed The advantages 
of using these speed tests aie that diey require less time, and since 
they are so unlike oidinary school woik theic is less ovei stimula- 
tion to cheat On the other hand, they requiie more skill to ad- 
minister, and there is a larger practice effect This latter is partly® 
controlled by giving an additional piactice form before cheating 
IS permitted 

The reliabilities of such honesty tests are not high Using the 
power tests Hartshorne and May found the following reliabilities 
(10, II, p. 93): 


Table 61 

Reliability of Honesty Difficulty Tests* 
(from Hartshorne and May) 


Honesty tests 

Reliability 

coefficient 

Arithmetic problems . 

484 

Completion sentences 

48s 

Information 

535 

School tests 

752 

Vocabulary 

Home test . ... 

205 


The mtercorrelations are arithmetic and completion, 454, 
arithmetic and information, 481, and information and comple- 
tion, 450. 

The reliabilifes of the honesty speed tests are (10, II, p. 95)' 


Table 62 

Reliability of Honesty Speed Tests * ** 


Additions 

Number checking 

Cancellation of A’s 

Digit symbols 

Dots 

Cancellauon of digits .. 


424 

322 

502 

36s 

397 

490 


* From Hartshorne, H , and May, M A , Studies m Deceit, 1928 By permis- 
sion of The Macmillan Company, publishers 

**Ibi 4 . 
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Besides these techniques for measuting cheating in school situa- 
tions, Haitshorne and May used other foims of testing to test 
cheating at home, in athletic contests, and in pailor games The 
test of cheating at home consisted simply of gi\ ing one of the tests 
already desciibed, to be done at home Results weie compared 
later with a test taken in school under standard conditions 

The tests m athletic performance consisted of standard strength 
or perfoimance tests in which two trials were given and an oppor- 
tunity to cheat was permitted on one of the tests The tests used 
were 

1 The hand dynamometer for measuring strength of grip 

2 The spiiometei for measuiing lung capacity 

3 Chinning 

4 Standing bioad jump 


As in the school tests m which the duplicate testing technique 
was employed, the possible difference between two trials in which 
there was no cheating was determined, and any difference which 
exceeded this when cheating was allowed was called cheating The 
limits beyond which any difference was called cheating are as 
follows . 


Dynamometer 
Spirometer 
Chinning 
Broad jump 


3 kilograms 
25 cubic inches 
3 times 
7 inches 


The reliability of these tests determined by intercorrelaticJn and 
raised by the Spearman-Biown formula is 772 
Games ordinarily played at children’s parties were also used as 
situations in which cheating might take place and where it could 
be measured The games used were. 

1 A peeping stunt 

2 Pinning the tail on the donkey or the arrow on the target 

3 Bean Relay 

4 The Mystery Man 

Only one of these, the Bean Relay, will be described here to illus- 
trate the method This is a modified potato race, beans being used 
instead of potatoes The races were run off five at a time to simu- 
late a contest Besides the home box in which the beans were to 
be deposited there were three other boxes, the first and second 
with three beans each, and the third with teit or more beans. The 
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object was to cany as many beans as possible one at a time to the 
home box m thirty seconds Observers counted the number of 
trips each child made When the heat was over, the number of 
beans in the ‘‘home box” was also counted and if there were more 
beans than runs this was prima facie evidence of cheating 
The reliability of these tests is not reported 
Tests of stealmg. Four tests were used by Hartshorne and May 
to measure stealing These are. 

I. The Planted Dime Test 

2 The Magic Square Test 

3 The Coin-Counting Test 

4 The Mystery Man (one of the party games) 

In the coin-counting test (which will be described to illustrate 
the method) each child was given a mimeographed sheet contain- 
ing such problems * as (10, p. 93): 

What three (3) coins add up to forty (40) cents ^ 

( ) quarters ( ) dimes ( ) nickels ( ) pennies 

What three (3) coins add up to twelve (12) cents ^ 

( ) quarters ( ) dimes ( ) nickels ( ) pennies 

What three (3) coins add up to thirty-one (31) cents ^ 

( ) quarters ( ) dimes ( ) nickels ( ) pennies 

Bo.\es were also distributed containing the following coins i 
quarter, 4 dimes, 4 nickels, and 4 pennies “The pupils were told 
that It was a money counting test and that, in order to make it a 
real test, the coins were to be used to count with instead of writing 
on the paper.” When they found the right combination with the 
coins they entered the number of coins used on the mimeographed 
sheet These sheets were collected after a certain time limit and 
then the directions were given* “Now put all the money back in 
the box, put the band around it, and we will collect it Pass the 
boxes to the center aisles ” Opportunity was afforded for any child 
to take some of the money if he was so inclined 

To identify the boxes the following device was used A blurred 
space was purposely left in one of the problems and the children 
were instructed to copy the number on the bottom of their box 
in this space Since they signed their names to the sheets, identifi- 
cation of the boxes was effected 
The reliability of these tests has not been stated. 

•From II.Trtsliornc II, and May, M A Studies vi Deceit (1928) By per- 
mission of The Macmillan Company, publishers 



Performance Tests 


315 

Tests of lying. Two tests of lying were constructed, both paper- 
and-pencil tests, one to measure lying to avoid disappioval, the 
other to measure lying to gain approval 

One of these tests to measure lying to avoid disapproval was a 
set of questions of which the following are samples * (10, p 95): 

33. Did you ever cheat on any sort of test^ 

34 Have you ever cheated on such tests more than once^ 

45. On some of these tests you had a key to correct your paper by. 

Did you copy any answer from the key? 

Two scores were given. One was a truthfulness index, whch was 
determined by the number of admissions of having cheated on the 
tests of cheating previously administered. The other was an index 
of lying which was obtained by comparing the answers with the 
actual results on the cheating tests The scoring method is some- 
what complicated but involves a comparison of the answers to the 
questions with actual conduct on the previous test. 

The other test designed to measure lying to win approval con- 
sisted of a number of questions concerning “specific acts of con- 
duct which on the whole have rather widespread social approval, 
but which at the same time aie rarely done ” Examples are ** 
(10, p 98). 

4. Do you usually report the number of a car you see 


speeding^ Yes No 

5. Do you always preserve order when the teacher is 

out of the room^ Yes No 

13 Do you usually pick up broken glass in the street^ Yes No 

29 Do you read the Bible every day^ Yes No 


To determine the critical point that divides honesty from lying 
the investigators had a graduate class answer the questions in a 
way that would represent their childhood. Then a point differing 
by three standard deviations from the mean of each test for this 
adult group was taken as being the critical point a score above 
which indicated the presence of lying. 

Another type of deceit which seems to be neither cheating, 
stealing, nor lying is that of failing to report what is credited to 
one in error This partakes of all three — flying, stealing, and cheat- 
ing. Miller (12) tested this by planting errors in the scores on 

•From Hartshorne, H, and May, M A, Studies in Deceit (1928) By per- 
mission of The Macmillan Company, publishers 

**Ibtd. 
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papers returned to students enrolled in the summer session of a 
large uni\cisity The paper of c\eiy oihei student was correct, so 
that there i^cahl he little lilcehhood of aiousing suspicion Students 
were asked to chock the maiks which they received The results 
indicated that fewer students icportcd the error in scores which 
were too high than m scoies which were too low This technique 
does not peim.L fixing the guilt on any ndixidual student, as it is 
impossible to separate intention from carelessness or negligence. 

All m ail, the tests for measuring deceit devised by the Char- 
acter Education Inquiiv pioxtde a very comprehensive means 
for survex mg the xarious txpes of deceitful behavior in sex'- 
eral different situations. So thoioughly has the work been done 
that the techniques xx-ill probably not hax'e to be added to for 
many years When the thirty-two different tests proposed are 
lexiewed, it is seen that their administration is cxpensix’e, both in 
time and in monev For this reason alone if for no other, the use 
of tests of this kind xxill probably always be restricted to experi- 
mental xx’ork xvhere adequate funds are axailahlc 

All of the results reported by Hartshorne and May in validat- 
ing their tests cannot be repeated here. Three tables xvill be given 
shoxxing the reliability of types of deceptive behavior studied, the 
intercorrelation betxxeen types of deceptive behavior, and average 
correlation betxx-een single tests of different techniques * 


T.xble 63 

Reiixbility of Txtes of Deceptix-e Behavior 
(from Hartshorne and May, lo, II, p. 122) 



Types of Jeceptwn 

Reliability 

I 

The classroom txpe of cheating 



A Copying from a key 

871 


B Adding more scores after time b called 

8zs 


C Peeping 

721 


D Faking the solution to a puzzle 

7 SO 

II 

The out-of-classroom types of cheatmg 



E. On home xvork 

. .240 


F On athletic contests, faking a record 

•772 


G In parties faking, peepmg, stealmg 

Not known 

III 

H The stealing 1376 of dbhonesty 

Not known 

IV 

The Ivmg t>-pe of dishonesty 



I Lx'ing for approval 

.836 


J Lying to escape disapproval 

Not known 


•From Hirtsho'ne H and May, M A, Studies in Deceit (1928) By per- 
mission of The Macmillan, Company, publishers. 
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T\nn 6j. 

Intescorrh-Uiovs or Nine ’1\i-ls or Di-chtivf Bfjiavior 


The measures of anv t\pL ici're-.ciit the loniposite of all tests used 



(from Hartshorne and M?/, 

10, II, 

P 123) 




B C 

D 

E 

F 

G 

H 

I 

A 

450 400 

400 

172 

2B8 

118 

143 

350 

B 

374 

42s 

193 

34S 

169 

173 

.248 

C 


300 

234 

100 

250 

200 

.108 

D 



X 

300 

122 

346 

.256 

E 




142 - 

-01s - 

■010 

400 

F 





118 

283 

230 

G 






210 

— 004 

H 



Table 65 




.132 

Average 

Intercorrelations 

BETWEEN Single Tests of DirrrRFNT Techniques 


(fiom Hartshorne and May, 

10, 11, 

p 212) 




B C 

D 

F 

H 

I 



A 

292 2S5 

291 

198 

127 

312 



B 

219 

iSS 

194 

128 

254 



C 


196 

062 

160 

161 



D 



.184 

283 

208 



F 




162 - 

-003 



H 





.132 



A progressive lowering in 

reliability may be seen as 

one goes 

from a 

Situation that is 

very similar to one 

that IS 

different. For 


instance, the aveiage correlations with a single test of copying 
fiom a key are as follows' 

One test of copying from a key with another test of copying .696 
One test of copying fiom a key with one test of adding mote 

scores after time is called (both school tests) 292 

One test of copying fiom a key with one contest (the latter 
being outside the class-ioom) 198 

As the situation presents features which differ the response 
varies, or to quote Hartshorne and May, “an individual behaves 
similarly in different situations in proportion as these situations 
are alike ” The conclusion from this is that a single test of deceit 
has little symptomatic value of deceit in general The value of a 
single test as an honesty test depends on the degree to which the 
single situation has features which are common to all the situa- 
tions in which honesty may be displayed. In applying this rule one 
must remember that few individuals resfiond to the abstract 
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features of a situation — to the honesty factor, for instance — but to 
such concrete things as the teacher, pencil, paper, school marks, 
coins, stamps, liJcets, social stimulation, case with which deceit 
may be accomplished, etc A test which employs one set of these — 
paper, pencil, class-room, teacher out of the room, pupils taking 
a test, knowledge that the test counts for promotion — may be 
quite dilfeicnt so far as incentives to dishonesty go from a situa- 
tion that involves a contest, spoons, boxes, beans, running, ob- 
servers with pads and pencils We should expect a test of cheating 
in one situation to give results similar to a test in the other 
situation only to the degiee that pupils recognize (respond to) 
some element common to both 


Tests of Suggestibility 

The precise bearing of “suggestibility” on conduct is uncertain 
However, the matter is woithy of investigation, if only to point 
out the negative effects of suggestibility Whipple, in his manual 
of Mental and Physical Tests, summarized in his usual complete 
fashion the work that had been done with tests of suggestibility 
up to the time that he wrote (1915). He describes these tests as 
follows (29, p 222) : 

“In them (tests of suggestibility) the experimenter seeks, by 
suitable arrangement of the test-material or of the instructions, to 
induce the subject to judge otherwise than he natuially would — 
to induce him, foi example, to judge equal lines or equal weights 
to be unequal, or to perceive warmth where there is no warmth, 
etc If the attempt is successful, the subject is said to have 
‘yielded,’ or to have ‘accepted’ the suggestion, if unsuccessful, he 
IS said to have ‘resisted’ the suggestion The degree of his sug- 
gestibility IS indicated by the quickness or frequency of his 
‘yields ’” 

Whipple describes five tests. 

Suggestion by the Size-Weight Illusion 

Suggestion by Progressive Weights 

Suggestion by Progressive Lines 

Suggestion by Line-Lengths by Personal Influence 

Suggestion by Illusion of Warmth 

The most work has been done on the first of these tests, and 
hence it will be described briefly here. Its purpose is to see to what 
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degree big things are judged to be heavier than small things Two 
standard cylindrical blocks and twenty compaiison blocks aie 
placed before the subject 

“Both standaids weigh 55 giams' both are 28 mm. thick, but 
the larger is 82 and the smaller 22 mm in diameter. The 20 
comparison blocks are all 28 mm thick and 35 mm in diameter, 
but the weights range fiom 5 to 100 g by 5 g inciements. 

“Place before the subject the larger standard block and say. 
‘Here is a block I want you to find a block in this series of 20 
blocks that seems to you just as heavy as this one Lift it by 
picking it up edgewise with your thumb and finger like this (illus- 
trate) Then try the first of these weights (at the left) If that 
doesn’t suit, try the next, then the third, and so on till you find 
a block that seems equal to this one Each time you must lift this 
block first, then the one you aie Iryung in the series Keep your 
eyes constantly directed at the weight you are lifting ’ When the 
subject has selected an equivalent weight, the same procedure Is 
followed with the second, or smaller, standard block.” 

The score Is the difference between the weights of the standard 
blocks and the ones selected to be of equivalent weight. Whipple 
reports Gilbeit to the effect that the illusion is well developed at 
the age of six years, apparently increases until age nine, and then 
decreases with advancing age Vaiious investigators have found 
that defective children with mental ages below four fail to get the 
illusion at all Reports differ as to the relative suggestibility of the 
sexes Practice seems, if anything, to increase the illusion. 

Readers are referred to Whipple’s manual for a description of 
the remaining tests 

W. Brown, working later, made very extensive researches on 
suggestibility, using a wide variety of situations including a “least 
perceptible” (imagined) sensation, the perception of change; a 
series of progressive inci eases in weights and lines; memory, 
recognition, and imagination, normal illusions, estimations of 
magnitude, simple esthetic preferences in the matter of properties 
in geometrical figures, and simple esthetic preferences in the 
quality of sensation. 

His conclusions are (23, p. 424) : “There are no individual 
differences which are sufficiently conspicuous to justify the experi- 
menter in calling one person ‘very suggestible’ and another ‘not 
suggestible.’ There are no individuals who have consistently high 
or consistently low Indices of suggestibility through a series of 
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tests On the contran , the experimenter is struck by tlie fact that 
the most skeiitiLal individual will yield at times with suiprising 
readiness to the suggestion, while a peison who has yielded to 
some tests with veiy little apparent lesistance will unexpectedly 
become veiy lecalcitrant ” This statement denying that there are 
tendencies to be litgldy suggestible or the contrary shall be tem- 
pered by a moie exact statement as to the degree of intercorrela- 
tion Blown found 472 mtercorrclates between the different tests 
which he used, the aveiage of which was -|- 143 This leads him 
to make the statement (23, p 430) ‘'It is probable that an indi- 
vidual who is more suggestible than another in one of these tests 
will prove more suggestible in another Yet the actual amount of 
the correlation is so small and the number of negative instances so 
great that the ‘probability’ m the above statement can only be 
very slight While it seems to be true that suggestibility is a trait 
more conspituotisly developed in some individuals than in others, 
yet the individual diffeiences are small and seem to be subject to 
reversal undei the influeni.e of conditions which are not wnthin the 
control of the expe '"'enter 

Prideau (27), wiitmg in 1920, states fiom clinical experience, 
“(a) Suggestibility viics in different persons irrespective of the 
nature of the suggestion and of the suggestor. (b) Suggestibility 
vanes in the same person at different times and under different 
conditions fc) Suggestibility may have reference to a particular 
system of ideas only (d) A person may be suggestible toward 
one person and not towards another ” 

In other words, one test of suggestibility Is practically worthless 
because its lesults are so very specific In order to test an indi- 
vidual for suggestibility, many tests must be used representing a 
variety of situations. But this would add to the time, difficulty, 
and cost of the testing and would defeat its own purpose. 

M Otis (26) attacked the problem later (1924) with a fresh 
approach Criticizing previous work on the ground that various 
types of reactions had been grouped under the name suggestibility, 
she essayed to study “ability to resist a suggestion” by means of 
a group paper-and-pencil test. The test comes m two forms, A 
and B, of forty items each. Only twenty-two m each form may be 
used to measure suggestibility, however, the remaining items being 
innocent items from the Woodworth-Wells Directions Tests. Each 
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pupil IS given a slicei on which he lecoids Ins response, and the 
diiccLions aie lead aloud by the evaminei 
Sample item (26, p 6S) 

Directions lead aloud by 
tea< her 

I. You see here some circles 
Write a woid 111 the third 
circle. (10") 


3 Write the answer to this 
question Do butteillies cat 
bugs or green leaves^ (to") 

5 Here are two squares and 
two circles If the circles aie 
larger than the squares, put 
a dot in the center of the 
smaller square. (Pause ) 

When through with that, 
write the letter a under the 
larger square. 

Otis found correlations of sufficient size (average 56) between 
the two forms of her test to justify her in claiming “There is a 
trait that we may call ability to resist suggestion ” Part of this 
reliability, however, comes via the influence of intelligence, which 
correlated with the tests 77, 72, and 75 on three different occa- 
sions When a group of 100 children having the same mental age 
(90 to 9 ii) were tested, tlie correlation bctucen Form A and 
Form B dropped to 19 So we are forced to dissent from her con- 
clusion that her test indicates the presence of a definite trait, 
“ability to resist suggestion,” which exists apart from the intelli- 
gence involved in the situation. 

Tests of Persistence 

Tests of persistence have been few, and yet the scanty evidence 
which is available indicates that there is something in tests of this 
type which holds considerable promise for the diagnosis of conduct. 

In school work there is a good reason to believe that persistence, 
or sticking to a task, is one of the mam factors that helps to supple- 
ment or compensate for ability Many a dull child has won a 
“passing” grade through the exerase of unutflial diligence. There is 
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also good evidence that some bright children tend to let go of a 
task too quickly when they have done enough to excel and to win 
the praise of the teacher and the appiobation of fellow-pupils 

Persistence is also a piime factor of success in the workaday 
world. The lolling stone has won pioveibial fame for failure 
Studies of successful men, whether in business or in professional 
life, indicate that in every case there is a certain persistence of 
activity that produces the fruits Here again nothing can com- 
pletely compensate for lack of talent, but there is considerable 
leeway in the ebb and flow of aflaiis where sticking to it will cause 
one man to succeed where another man of equal ability but with 
less persistence would fail Furthermore, there is evidence, as will 
be seen in the following pages, that peisistence is also a factor in 
adequate social adjustment 

In 1911 Fernald (32, p 331) reported at a meeting of the 
American Psychological Association a paper entitled “A Kinetic 
Will Test ” He was interested in the problem of testing and 
examining delinquents and in discovering tests which would differ- 
entiate between them and normal youths. He used a strictly 
deductive approach in constructing his test “It is essential to 
include in any comprehinsive group of psychological tests to be 
applied in the classification 01 defectives among prisoners an ade- 
quate test for the function called wtll, since the success or failure 
of individuals depends so largely on the ability to endure and to 
continue to strive for the sake of achievement and in spite of 
fatigue and discouragement ” 

The test makes use of an apparatus which consists of a platform 
on which the subj'ect rests his heels and a dial at the height of the 
eyes informing the subject of the height of the heels from the 
floor The test is to see how long one can stand on tiptoes with 
heels an eighth of an inch or more from the floor. When the sub- 
ject brings his heels to the floor, a bell rings indicating that the 
test IS through The dial also has a dead-line, and if the subject 
gradually weakens, the directions warn him that he must bring 
his heels to the floor when the little black disc passes the mark 
three times. 

Dr Fernald gave the test to Ii6 reformatory prisoners and 
twelve members of the senior class of the Ringe Manual Train- 
ing School of Cambridge, Massachusetts, with the following re- 
markable results: * 
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Median time Lowest score Highest score 
Reformatory group . 14' 54" 2' 30" 52' 45" 

Normal group 36' 12' 2° 30' 6" 

Comments on the result of the test show that in every case the 
decision to “give in” preceded the necessity of it, that varying 
degrees of weight and vaiying degrees of strength neutralize each 
other, that previous training was not a factor in differences with 
respect to the particular function tested Fernald mentions as a 
point in favor of the test that it is independent of language, but in 
view of the doubtful value of other non-language tests for measur- 
ing conduct, this point must be taken with reservations. 

Bronner (30) reports that in later expeiimentation with the 
Fernald test the norm was 50 minutes, making it inconveniently 
large for expeiimentation She tried out another persistence test as 
follows “The subject was given a pair of iron dumb-bells, each of 
which weighed tw o pounds She was told on a given signal to take 
one in each hand and extend the arms level with the shoulders, 
holding the dumb-bell in a horizontal position As soon as the arms 
were dropped about five inches or more, the time score was taken.” 
In trying out this test Bionner found that but three out of twenty- 
six members of a gioup of delinquent girls reached or exceeded 
the median of a group of college students “It would seem that 
the members of Group C (the college group) were much more 
willing to endure physical discomfort foi the sake of a good record 
than were the members of Group D (the delinquent group) Very 
frequently girls in the latter group would remark, ‘Oh, it hurts’’ 
and drop the dumb-bells They seemed on the whole to have much 
less will power and physical endurance, at least in matters where 
there was no necessity for continued discomfort other than mere 
pride in a deed well accomplished ” Both of these tests show such 
remarkable differences between normal and delinquent groups 
that tests of this type are worthy of further trial. However, in 
future work care should be taken that the results do not depend 
on physical strength or special skills, but merely on willingness to 
endure. 

Morgan and Hull (34) has introduced the use of a partially 
concealed maze which is impossible of accomplishment as a test 
of persistence. Not satisfied with a mere measure of time, they 
had observers rate the persistence of subjects on a nine-point 



324 Diagnosing Personality and Conduct 

scale. They decided that the reliability shown by the judges in 
this subjective evaluation of persistence showed close enough 
agreement to wairant further expeumcntation with its use 

Persistence in taking mental tests wheie there is no time limit 
imposed is a natural method of testing persistence Persistence has 
been suggested as a factor in the three-hour Thorndike “Intelli- 
gence Test for High School Graduates,” also in the Thorndike 
“CAVD” tests, but the effect of persistence has never been dis- 
entangled for separate study from the factor of ability. Chapman 
made a preliminary study of persistence in a test of word-buildiiig 
scoieci by total number of woids obtained and total time con- 
sumed He found a considerable positive corielation (average 65) 
between peisistence and success Although this correlation is high, 
Chapman cites evidence to support the belief that it could be even 
higher if the capable children did not give up so easily “It will 
be seen that whereas there are very great differences in speed of 
thought between the slow and rapid groups, the persistence of the 
slow group goes a long way towards compensating for the lack of 
speed ” 

Chapman (31) points out that persistence is extremely sensitive 
to group stimulation Whereas in certain public schools the median 
time of woik was only ten minutes, in a certain parochial school 
at the end of twenty minutes only three pupils out of forty had 
abandoned the task He mentions four factors which may influence 
tests of this type, (a) habits of the pupils, (b) diicctions given by 
and prestige of the examiner, (c) setting of the test, (d) manner 
in which the children weight the importance of the lesult 

Very ingenious tests wcie devised by the Character Education 
Inquiry (33) to measure persistence In the “Cross Test,” the 
familiar and exceedingly difhcult “Japanese Cioss” puzzle, consist- 
ing of wooden sticks which can be made to inleilock in the foim 
of a cross, was used With this was used a puzzle called the 
“Chinese Rings ” Each child being tested was allowed to choose 
which one he would work with first, and the score was the time 
spent working on the first puzzle tried Magic number squares and 
magic word squares were used as puzzles in the same way. 

A different type of measure of persistence was obtained from 
monotonous tests of simple additions. These tests were also used 
in the measurement of cooperation Twelve sheets were added in 
succession, requiring Vork for twenty-four minutes without rest. 
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Six of the sheets counted for each individual and six for the class. 
The peisistence scoie was the diffeience between the scores made 
on the fiist tivo sheets and on the last tuo sheets 

The leliabilities of these tests weie faiily high They were ob- 
tained by con elating the first half of the testing with the second 
half, and then raised by the Spearman-Brown formula to indicate 
what reliability could be expected from the complete testing. 


Table 66 

Reliabilities of Persistence Tests ' 



Reliability 

Reliability 
corrected by 
Spearman-Brown 


coefficient 

formula 

Story resistance 

S9 

7S 

Cross and ring with magic number 
square 

38 

SS 

Persistence for self 

79 

88 

Persistence for class 

8S 

92 

Whole battery .. . . 

80 

89 


But the mtercorrelations show the usual tendency to be very 
low. 


Table 67 

Intekcokrelattons of Persistence Tests •• 




2 

3 

4 

5 

I 

Story resistance 

35 

.16 

•15 

.18 

2 

Magic number squares . 


38 

20 

IS 

3 

Cioss and ring 

, 


.22 

18 

4 

Persistence for self 




42 

5 

Persistence for class 

. . 





Average inter-r 


239 


Cooperation or Service 

Original tests of cooperation for use in the class-room were 
devised by the Character Education Inquiry. Three of these will 
be briefly described. For a complete description, the reader may 
turn to Hartshorne, May, and Mailer’s Studies in Service and 
Self-Control.^ 

• From Hartshorne, H , May, M A , and Mailer, J B , Studies in Service and 
Self-Control (1929),? 330 By permission of The Macmillan Company, publishers 
Ibid 

tThe Macmijlan Company, 1929 
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In the “Kits Test” there were given to each child school pencil- 
boxes containing ten articles dunking cup, pencil sharpener, ruler, 
eraser, pen, penholder, double pencil, and thiee other pencils Each 
child was allowed to give away any number of the articles in his 
kit that he desiicd to help make up kits for poor children who 
had none A table was diawn up weighting the value of the articles 
in inverse proportion to the peicentage given away, and pupils 
weie stored according to this key on the articles they gave away 
In the “En\ elopes Test” four envelopes weie distributed to the 
children and directions were given to find at home and put in the 
envelopes jokes, puzzle pictures, short stories, or a beautiful pic- 
ture if they wanted to help contribute these to children m hos- 
pitals Again a score sheet was prepared giving values for each 
envelope according to the number of items supplied. The score 
was the sum of the credits thus earned 

In the “Money Voting Test” a ballot was prepared for each 
child in a class to vote on how he would like to see prize money 
disposed of The numbers in brackets are the rank order of the 
Items in social significance according to the judgments of the 
experimenters. 

(4) Give all the money to the boy or girl scoring highest on the 
test. 

(2) Buy something for our school, such as bats, balls, skipping- 
ropes, a big picture. 

(3) Buy something for the room, such as a picture, a globe of 
goldfish, some plants 

(s) Divide the money equally among the members of the class 
(i) Buy something for some hospital child or some family need- 
ing help or for some other philanthropy 

The pupils were instructed to rank these in order of their choice 
Each item was given a score of 2 if correctly placed in the ballot, 

T\ble 68 

Intercokkelations of Service Tests * 



2 

3 

4 

S 

I Free choice 

,20 

17 

13 

.20 

2 Efficiency cooperation 


27 

32 

.21 

3 Money vote 



•27 

12 

4 Kits 




12 

S Envelopes 





Average mter-r 

201 





* From Ilaitshorne, H , May, M A , and Mailer, J B , Studies m Service and 
Self-Control (1929) By pbrmission of The Macmillan Company, publishers 
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a score of i if misplaced by one rank, and a score of o if misplaced 
two or more ranks. 

Inhibition 

A set of tests was devised by the Character Education Inquiry 
to measure inhibition. Only three of the devices will be desciibed. 
The reader is referred to Hartshorne, May, and Mailer’s Studies 
in Seivice and Self-Conti ol for complete descriptions. 

In the “Story Inhibition Test” each child holds a copy of 
a very exciting stoiy which is read aloud by the teacher At 
the climax the stoiy continues on the next page The children 
are given the alternative of counting words on the page just read 
m order to make a score on a test, or opening the folders which 
aie stuck together to find out the end of the story A score of 
I IS given if a pupil does not break open the seal. Two stories 
are used 

In the “Safe Manipulation Test” a toy combination safe was 
placed on each pupil’s desk with instructions not to touch it 
until latei when it was to be used, because if any pupil touched 
his immediately he would get an unfair advantage. Then a senes 
of SIX papei-and-pencil tests was given The examiner checked 
the dial of each safe between each test to see if it had been 
touched by the pupil Scores were assigned according to the 
number of times the dial was in a different position on the six 
inspections. 

The “Puzzle Manipulation Test” consisted of a box with vari- 
ous puzzles carefully placed on a puzzle-peg board so that if 
any article was disturbed it could be lecogmzed. As in the pre- 

Table 69 

Intercorrelations of Inhibition Tests* 




2 

3 

4 

5 

6 

\ 

Stones 

.277 

.073 

.320 

050 

.210 


Safes 


500 

256 

001 

•384 

J 

Puzzles 



.307 

177 

.32s 

4 

Pictures 




420 

.240 

5 

Puzzles 





X)IO 

6 

Candy 







Average mter-r 

.237 






• From Hartshorne, H , May, M A , and Mailer, J B , Studies m Service and 
Self-Control (1929) By permission of The Macmill^ Company, publishers 
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vioiis test a box containing the puzz'es was placed on each desk, 
and directions ucie gi\en not to molest it while certain paper- 
and-pencil tests t\erc tiicd Latei the boxes neie lemovcd to 
another room where they were inspected and scored 

The Stoiy Inhibition Test has a total reliability of 65, and 
one manipulation test correlated with another about .50 

Measures of Caution 

“Caution” has been studied by Brown (37) and Manson (38) 
m connection with intelligence tests only It has been observed 
m the taking of objective examinations that there is a twilight 
area between the questions which aie answered with every cer- 
tainty that they are correct and those questions which are omitted 
altogether because the person taking the test is sure that he does 
not know their answers The lesponse made in this twilight zone 
differs according to the way the test is scored If the test is 
scored accoiding to the number of right responses, there need 
be no tendency to show caution, for every correct answer counts, 
whethci obtained by know’ledge or chance But if the test is scored 
by the foimula “right minus tviong,” where an individual is 
penalized by making a wrong answer, the rashness-caution factoi 
IS apt to be exhibited Some persons under these circumstances 
will attempt every item, confident that the vague associations 
which they are able to make in connection with the item will 
lead them to give the correct answci Others exhibit extreme 
“caution” and refuse to answer any items except those of which 
they are certain 

Blown (36, 37) measured “caution” in those tests on the Thorn- 
dike “Intelligence Test for High School Graduates” in which 
the subject is warned that a wrong answer will count off from 
his score because the number of zero and minus scores will be 
counted — that is, the number of wiong answers The argument 
here is that (37, p 46) “the person in whom the caution factor 
is operating at its maximum would most probably refuse to write 
down any answer of which he was not absolutely sure Lack of 
the knowledge that he is right, even when he has a high degree 
of confidence in his answer, would tend to make the extremely 
cautious individual count the item altogether out, while a less 
cautious person would hazard a guess in many cases ” In the 
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Binet vocabulary test and the fiabue completion test, the cau- 
tion index was expressed as the ratio ol number of items wrong 
to number of items tried 

Brown found that theie was a consideiabic coi relation be- 
tween his caution index and intelligence and also some with 
scholarship (.27 and 15 respectively), but that the correlation 
with conduct and the time taken in the test is ‘~r> siiiaJl as to 
be negligible. 

As will be seen in the discussion under the measurement of 
studiousnessj it is difficult to make a satisfactory study of per- 
sonal characteristics by means of test lesults because of the 
statistical unreliability which follows when test scores are com- 
bined by taking differences or quotients One suspects that the 
measures of caution employed by Biown also have a large ele- 
ment of chance which tends to reduce then reliability. 


Speed of Decision 

Another possible confact lending itself to study by testing is 
“speed of decision.” It has been noticed that peisons differ in 
their ability to make a choice or render a decision Some seem 
to be so confused by the relative advantages and disadvantages 
of the issue at question that a decision is difficult Just when the 
balance seems to be surging in one diiection, some other point 
ivill spring into prominence to countei balance it and block the 
issue Other persons, on the other hand, seemingly have little 
difficulty in weighing the issues and estimating their lelative im- 
portance, thus speedily reaching a decision 

In addition to the decision teat in tlie Downey “Will-Teiiipera- 
ment Test” to be reported upon, studies in decision have been 
made by Budges (39), Gibson (ji). Filter (40), and Trow (42). 
Of these studies the one that enlightens us most concerning the 
value of tests of speed of decision is that by Trow. Fie used 
eight tests as follows: 

I. Line discrimination. Ten cards were prepared, each of which 
had a vertical line 100 mm. in height resting on a horizontal line 
which varied in length in the different cards from 95 to 105 mm. 
The subject was instructed to compaie the length of the two 
sections, and the time after exposuie of the card for such de- 
cision was recorded. 
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2. Weight dts crimination Eight standaid weights varying from 
84 to 112 grams weie used, a loo-gram weight serving as a 
standard of companson Weights weic picsentcd m pairs, one of 
each pair being the loo-gram weight, and the subject was in- 
structed to state which was the heavier, the time of making the 
decision being recorded 

j. Spelling Twenty difficult words in spelling were presented, 
eleven coriectly spelled and nine with slight misspellings The 
subject was instructed to indicate by marking with an R or 
a W whether the word was coiicctly or incoriectly spelled 

4 Ethical judgment A blank with twenty questions such as, “Is 
capital punishment ever right'” was used The subjects were in- 
structed to answer each question by writing Y or N, standing for 
yes or no, befoie each question 

5. Belief The Summer list of beliefs was used A sample ques- 
tion IS, “i Is the world becoming better^” 

6. Rating. Names of subjects in the experiment who were 
members of a psychology class were typed on cards Each subject 
was instructed to arrange these cards m order, placing at one 
end the individual in the group judged to be the most self-con- 
fident and at the other end the individual judged to be the least 
self-confident. 

7 Speed of decision This test is taken from the Downey “Will- 
Temperament Tests ” 

8 Finality of judgment. This is also one of the Downey will- 
temperament senes. 

Twenty-seven subjects were given this battery of eight tests, 
each being a measure of the speed of decision. Reliability coeffi- 
cients are not given. Intercorrelations were as follows. 

Table 70 

CoRPELATlONS BETWEEN THE DIFFERENT MEASURES FOR SpEED OF DECISION 



(from Trow, 42, 
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From these figures Trow concludes (42, pp. 541, 542), “The 
lack of what may well be called trait consistency is the conclu- 
sion that IS foiccd upon us by this study of the speed of decision 
. . . It seems clear from the above data that the persons who 
are quick to decide in some cases are slow in others, and vue 
versa. The man who is quick to decide to buy a house might be 
slow to decide which suit to put on in the morning. The man 
who IS quick to decide against being a party to a shady deal 
might be slow to decide whether to fire a young employee caught 
in some petty dishonesty.” The measuie of time taken to make 
one decision is no dependable indication of the time necessary 
to make another decision. 

Filter (40) in a similar study comes to much the same con- 
clusions. When he used performance tests, his intercorrelations 
were very low. When he used a papcr-and-pencil test, his cor- 
relations are considerably higher Of this Filter says (40, p. 314): 

“Correlations between test results are positive and faiily high, 
indicating that individuals who are quick in decisions of one 
kind tend to be quick also in decisions of other kinds. This does 
not constitute a demonstration of high degree of constancy of 
speed of decision, however, as later qualifications show There is 
little likelihood that any one or two tests can be developed to 
measure the trait adequately.” 

Aggressiveness 

Another quality which has atti acted the interest of psychol- 
ogists IS aggressiveness, a Tiait supposedly of particular impor- 
tance in industry. Since salesmen, e\ecutives, and the like ap- 
parently achieve success 111 part by aggiessive behavior, a method 
for appiaising this trait more accurately should be helpful in per- 
sonnel work in industry and in vocational guidance. 

An interesting experimental attack on the problem of the meas- 
urement of aggressiveness was made by Moore and Gilliland (44). 
They define the quality in which they are interested as follows: 
“He is more likely to be vigorous, positive, and masterful than 
the man lacking in this trait; and he is less likely to shrink from 
notice, to avoid argument, to display a lack of ‘nerve’.” 

I. Test of eye control. Of the several tests which the experi- 
menters used one was a test of eye control. It is commonly be- 
lieved that ability to maintain a steady gaze is one characteristic 
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of the man of powei, and that a shifty eye is a sign of personal 
weakness The test consisted of pet forming a somewhat difficult 
senes of mental additions while constantly returning the fixed 
gaze of the instructoi who sat facing him The subject was em- 
phatically instructed that under no circumstances should he let 
his gaze wander from that of the man facing him, as all move- 
ments of the eyes w'ere to alTect his score seriously The number 
of eye movements is recorded as the test scoie Thirteen men 
rated high in aggressiveness were found to make only 41 eye 
movements as compared with 72 such movements foi the thirteen 
least aggiessive 

2 Tear distraction tests Subjects w’ere piactised in the addition 
tests until the piactice effect was apparently negligible The time 
m the eye control test picviously mentioned when five addition 
series were used was then compared with the time for five addi- 
tion series under noimal conditions For the aggressive subjects 
the staring caused an average delay of 4 of a second, for the 
unaggressive subjects it caused an average delay of 3 2 seconds 

A second senes of distraction tests was given using as before 
the five scries of addition tests as material and threatening the 
subject With the expectation of an elecliic shock which was to 
come during or at the end of each of the five senes of additions 
Although told that the shock might be fiom 75 to 220 volts, 
actually it W'^as never more than 75 volts and wms ahvays given 
after a series was completed so that it did not actually inteifeic 
The average shock-delay of the least aggiessive subjects w^as 6 i 
seconds as compared with an average of 2 2 seconds for the 
aggressive gioup 

In a third series a dead snake, -suitably coiled and pinned to 
a cork board, was placed about ten inches in front of the face 
of the subject while he was adding This stimulus caused an 
average delay of 8 2 seconds foi the least aggressive subjects as 
compared with a delay of 4 6 for the aggiessive ones 

Word association tests were also used, but these will not be 
described here. 

Since variability figures are not given, there is no way of 
checking up on the significance of these differences There Is ap- 
parently correlation between the different tests used, but the size 
of the correlation is not stated. 

Gilliland (42) later revised the test and partially standardized 
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It, omitting disti action by electiic shock and by the snake and 
adding a test compaiing writing at the noiinal rate and wTitmg 
as fast as possible The inciease in speed of w'liting is used as 
a measure of aggressiveness Coiiclations bctw'ecn the total meas- 
uie and aggiessiveness and othei factors are intelligence +02, 
scholarship + 02, private lessons m speech + 34, selling ability 
(newspaper advertising) +26 The coiielation with grades in 
piivate lessons in speech were found because aggressiveness is 
commonly spoken of as a characteristic of the forceful and ef- 
fective public speaker Intercorrelations between the different 
tests of aggressiveness aie not given 

Fuither w'ork on measuies of aggressiveness is needed They 
seem to have some diagnostic pow'er but it is not known to what 
degree the tests depend on the special conditions under which 
they are given. 

Studiousness or Effort m School 

In 1920 Franzen (49) devised a measure which he called the 
“accomplishment quotient ” Almost simultaneously Monroe and 
Buckingham, in connection with the Illinois Intelligence Exami- 
nation, proposed a similai measure which they called the “achieve- 
ment quotient ” Later the name was changed to “accomplish- 
ment ratio” so that it would not be confused with other quotients 
such as the intelligence quotient, in w'hich the denominator of 
the fraction is chronological age The accomplishment ratio is 
the latio of educational achievement to mental development If 
a child’s educational achievement is gieater than his mental de- 
velopment compaied wuth othei childien of the same age, it is 
a symptom that he has woiked haidcr than the aveiage child at 
his studies Theie seems to be no way that a child could suipass 
in educational achievement other childien similaily placed and 
of the same mental development except by greater effort or in- 
dustry. On the other hand, if the educational achievement is 
relatively lower than the mental development, a number of factors 
may be the cause The child may have neglected his school work 
or been deprived of educational opportunity, or been affected 
by other matters interfering with his school progress 

In finding the accomplishment ratio, educational achievement 
IS usually measured by the educational age. Educational age Is 
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found from tables showing the average score on a test made by 
pupils of dilleieiiL ages Usually several tests lepresentmg a va- 
riety of school subjects such as reading, aiithmetic, and the like 
are combined in an equitable way before the educational age is 
determined Mental age is similarly formed from the scores on 
an intelligence test by reading from a table the average scores 


made by pupils of different ages. The formula for the AR is 


EA 

MA’ 


A similar constant suggested by Franzen (48) is found by 
subtracting mental age fiom educational age, but this diffeience 
has not found such wide usage as the ratio 

After adolescence is reached and the rate of learning and 
mental giowth slows down, the units mental age and educational 
age become meaningless, and it is not possible to determine them 
except by artificial extensions of the tables of age norms Moie- 
over age loses its potency as a base in determining achievement 
in the secondary school subjects A unit to serve in such situa- 
tions has been proposed by Femgold (47) and Symonds (52). 
This “index of studiousness” is described by Symonds * as fol- 
lows (54, pp 521-524) 


“If It weie possible to eliminate from the achievement test 
scores the effect of intelligence test scores, the result, according 
to the argument and within the limits of test reliability, would 
be the effect of enviionment which we may call e§ort or studwus- 
ness. Looking at the matter in another way, intelligence test scoies 
give a measure of ‘average’ ability, and predict what may be 
expected in the way of achievement If more or less than the 
expected achievement results, the difference may be ascribed to 
greater or less effoit or greater or less studiousiiess 

“The elimination of intelligence may be accomplished in sev- 
eral ways One method is to turn both test scores and intelli- 
gence test scores into ranks and to obtain the difference be- 
tween the ranks This difference between ranks is the measure 
of studiousness desired 

“A second method for obtaining this measure of studiousness 
is to obtain standard deviations and the difference between stand- 
ard deviations. 

“The results using the two methods are in agreement in the 
main The first method using the difference between rank is the 
easier method and is preferred by some for that reason Both 
methods show a regression effect That is, it is extremely difficult, 

* From Symonds, P M, Measuremtnt m Secondary Education By permis- 
sion of The Macmillan Conpany, publishers 
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statistically, foi a pupil with high intelligence to make a plus 
studiousness index, and vice versa T.hat is, indeed, impossible 
in the method using ranks The pupil having a lank of i in in- 
telligence can have no lowci rank than i in achievement, and 
hence the largest difference for this pupil is zcio Likewise the 
lowest possible achievement diffeience for the pupil lowest in 
intelligence is zero This fault docs not occur in the method 
using S D differences and the S D diffeience is the one recom- 
mended 

‘‘This measure of studiousness suffeis fiom a fault common 
to most measures of conduct It diffeientiates between the mem- 
bers of a single class but does not peimiL compaiison between 
members of different classes Only when one or more individuals 
are common members of two groups can a compaiison between 
groups be made Assumptions about equal aveiage studiousness 
of different groups aie fallacious, as so much depends on the mo- 
tivation and the social setting. What is needed in this connection 
are standards on standardized tests which will peimit the com- 
parison between groups Such standards have been determined 
for nine of the best achievement tests that measure high school 
subjects ” 

The advantage of having a measuie of effort or studiousness 
is apparent. Probably no factor has more influence in determin- 
ing school achievement than native ability, but in the long run 
and within limits what is variously called industry or application 
or e^oit or studiousness can add to or subtract fiom what might 
be expected from the average use of ability 

The use of ratios or differences in studying the relative stand- 
ing on two tests has been studied by Chapman and Kelley Both 
find the method open to grave dangers In the first place they 
point out that the order of unieliability of the differences between 
two test scores is considerably larger than the unreliability of the 
tests themselves Chapman (46) has devised a formula which 
gives a reliability coefficient of the difference or quotient of two 
tests. 

Til -p r,3 


Where r ojoa is the reliability coefficient difference between an 
intelligence and achievement test 
Til is the reliability of the intelligence test 
rjs is the reliability of the subject-matter test and Ti, is the 
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correlation between the intelligence test and the subject-matter 
test 

Using the formula on i epresentative data, Chapman finds the 
reliability codlu-icnt of the diUcience between an intelligence test 
and an achievement test to be veiy close to zeio 

Kelley (50) independently attacked the same problem and has 
given us a foimula for the leliability of the difference between 
the scores that an individual makes on two sepaiate tests First 
the two scores must be transmuted into standaid scoies and then 
the reliability of the difference may be detei mined by the formula 

Oi = ‘\/ T. — rn — I211 

Kelley in addition to pointing ouf the unieliability of the dif- 
feiences between scores on intelligence and achievement tests has 
also discovered that an intelligence test and a general achieve- 
ment test measure so nearly the same thing that diffeiences be- 
tween them are all but meaningless Kelley says (51, pp 2i, 22), 

“On the average, in the neighboihood of 90 of the capacity 
measured by an all-iound battery scoie — reading, arithmetic, sci- 
ence, history, etc — and of the capacity measuied by a geneial 
intelligence test is one and the same If a compichensivc educa- 
tional achievement test and a gcncial intelligence test each give 
‘fairly reliable’ total scoics, each uould need to be more than 
ten times as long to jield equally leliable measures of diffcience 
between the educational achievement and the intelligence scoies 
This IS true not only because 90 pei cent of the tests measure a 
common function, but also because the chance factois cnteiing 
into this 90 per cent of each test tend to obscure whatever real 
difference is being measured by the 10 per cent This means that 
a scant one tenth of the tests are involved in the measure of 
difference, and, piactically, that judgments of individual differ- 
ences between intelligence and achievement based upon the com- 
monly available tests are quite unsound, being of an older of 
accuiacy not of the total scoies of the tests, but of total scores 
of tests less than one tenth as long The possibility of making 
sound judgments of this sort by utilizing much moie refined 
measures lies before us.” 

Thus as the matter stands the theory underlying the accom- 
plishment ratio or the studiousness index is attractive, but the 
actual intelligence and achievement tests which we now use give 
measures which do not possess the differentiation necessary to 
make the use of these derived scores satisfactory. 
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The Downey Will-Temperament Tests 

The chapter or section of a chapter in the history of psychol- 
ogy which will deal with the Downey “Will-Temperament Tests” 
should be at the same time pathetic and amusing — pathetic be- 
cause of the way in which gullible testers rushed in to use an 
mstiument for which much had been promised, and amusing 
because of the politeness with which it was treated by scientific 
woikers who had evidence of its woithlcssness Spurred on by 
the remaikable success attending the development and use of 
intelligence tests in the United States Army, psychologists after 
the World War began confidently casting about for techniques 
with which to measure other aicas of pcisonality When Downey 
in 1919 announced a test of will-tcmpcrament, it was hailed en- 
thusiastically and given willing trial by a large numbei of in- 
vestigators The name of the test itself was intriguing Many 
psychologists accepting at face value Miss Downey’s claims, made 
collections of “profiles” and attempted to inteiprct them Others, 
more skeptical and experimentally minded, put the will-tempera- 
ment tests to the usual tests of leliability and validity No other 
test desciibed m this book has had such widespread investigation, 
cairied out on a veiy high level of painstaking accuracy But 
from the very start the letuins were discouraging, and they have 
consistently told the same stoiy Something is the matter with 
our scientific educators wdien they will icmain so courteous and 
so hopeful in the face of evidence so convincing Freeman, for 
instance, concludes after reviewing the tests and the work which 
has been done upon them (69, p 105), “It is evident from the 
facts which have been presented that the Dowmey Will-Tem- 
perament Test, which is the most carefully standardized and 
most highly elaborated personality test which has yet been de- 
vised, IS unsuitable for wudespiead routine application in the 
school. It is still m the experimental stage ” Must we always re- 
main in a state of hope and expectancy concerning measuring 
instruments which yield poor results on preliminary trials'* Even 
such an intrepid investigator as May timidly concludes (78, p 39), 
“The results of all those attempts at validation by the rating 
method are uniformly ambiguous Nearly all the correlations are 
low Does this mean that the tests are not valid, or the ratings 
unreliable, or tliat this is not the proper* method of testing the 
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tests? An examination of the data reveals the fact that these 
ratings are about as icliable as ratings usually run Other kinds 
of tests have been validated by this method We venture the 
assertion that after ample allowance has been made for eriors in 
ratings the corielations will still be low” Aie we never to be 
able to call a spade a spade ^ 

In 1919 a bulletin from the University of Wyoming appeared 
entitled “The Will-Profile' A Tentative Scale for Measurement 
of Volitional Pattern” by June E Downey (S 9 )- Herein was de- 
scribed a senes of tests, based largely on conti oiled handwriting, 
purporting to measure qualities of will and tempeiament. Miss 
Downey \trote hei doctor’s dissertation in 1908 on Control 
Processes tn Modified Handwriting in which she described ex- 
periments in handwriting changes under various conditions, such 
as when the writer is blindfolded or attempting to do something 
else at the same time, or when there is a time-lapse between the 
directions for wilting and the actual wilting In previous work she 
had practised mustle-t eading and had become remarkably skilful 
in responding to the involuntary cues provided by psychologi- 
cally unsophisticated guides who concentrated on concealed ob- 
jects These studies led her to iccognize diffeiences in “motor 
impulsion,” ‘Resistance to opposition,” “motor inhibition,” “co- 
ordination,” and ‘‘perseveration” She says, 

“I discovered it was a simple matter to select exceptionally 
good guides for demonstrations in muscle-reading by a prelimi- 
nary trial of writing under distraction of attention The ‘good 
guides’ [those whom AIiss Downey could easily follow by muscle- 
reading to the concealed object] under such distraction produce 
an enlarged and lapid hand Extensive observation has thoroughly 
convinced me that this magnified and semi-automatic writing pro- 
duced by the simple device of writing when the attention is par- 
tially distracted by a concurrent process distinguishes the im- 
petuous individual for whom the motor discharge takes place 
easily and readily Writing, decreased in size or gieatly retarded 
m speed, appears under the same circumstances for those indi- 
viduals who ‘hold on to themselves painfully,’ who do not yield 
‘readily to automatism,’ who respond with increased effort of at- 
tention to the demand to handle a double process. The latter are 
the inhibited, obstructed, pondering type of individual.” 

Out of such observations were born the will-temperament tests. 

The onginal Dowuey Will-Temperament Test published in 
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1919 was an individual test In 1921 it was published by the 
Woild Book Company, and in 1922 the World Book Company 
also published a gioup Will-Temperamcnt Test At about the 
same time (1920) these tests weie used in the peisonnel investi- 
gations under way at the Bureau of Personnel Research of the 
Carnegie Institute of Technology in Pittsbuigh and a foim known 
as the “Carnegie Adaptation of the Will-Temperament Tests” 
was there developed by Ream. In 1925 a “Non-Verbal Will- 
Temperament Test” was developed by Downey and Uhr- 
brock 

In the original test there were twelve tests divided into three 
groups. The first gioup, composed of four tests, is designed to 
measure the “fluidity” or speed of 1 espouse and to determine 
whether an individual is of the speedy, “hair-tiigger,” fluid type 
or the cautious, deliberate type. The second group, of four tests 
measures fotcefuhiess and decisiveness of action Individuals are 
here thought of as varying from those who react in a forceful, 
decisive, determined way to those who are indecisive and easily 
led or controlled The third group of four tests measure care- 
fulness and persistence of reaction — ^the ability to attend to details 
and to continue a task for a long time 

Each of the tests will now be briefly described. 

I. Speed of movement. In this first test the subject writes the 
words “United States of America” at his ordinary speed As the 
test is given to individuals, the score is the time taken to perform 
the writing In the group test there is a time-limit of twenty 
seconds, and the score is the number of letters written 

2 Freedom from load The significance of this test would per- 
haps have been clearer if it had been called “freedom from 
inhibition ” The theory underlying the test is that some people 
habitually write near their maximum speed, while others habitu- 
ally write at a speed considerably below their maximum The 
explanation given for this is that these latter persons are subj'ect 
to “load” or “inhibition.” For this test the subject is instructed 
to write “United States of America” as rapidly as possible and 
(in the group test) as many times as possible from the signal 
to begin to the signal to stop. The examiner is' told to suggest 
speeding by voice and manner in the group test. This test is 
scored in the individual test by counting the seconds required 
for writing, or m the group test in the number of letters written 
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in the twenty seconds allowed, and then in both cases taking 
the ratio betneen speeded and noimal nritmg 

5. Flexibility. In this test the subject is instructed to write 
‘^United States of Amci ica,” changing the style of writing as much 
as possible “so that none of your fiiends nould know it” This 
test IS scored o, i, or 2 accoiding to the degree to which the 
subject IS able to change his style of writing as may be judged 
by comparing it with a scale of specimens This test is supposed 
to measuie either the dramatic or histrionic type of temperament 
or the exercise of ingenuity 

4. Speed of decision. In this test the subject is given twenty- 
two pairs of opposite traits (in the group test, thirty) and is 
asked to check the trait in each pair which characterizes himself 
The score is the speed with which the j'udgments are made (either 
the time taken to check the whole list in the individual test, or 
the number checked in forty-five seconds in the group test) The 
purpose of this test is to determine the speed with which a 
person makes j’udgments of this kind. 

In the next group of four tests we have 

5. Motor impulsion The subject is requested to write his name 
in his usual manner, then with his eyes closed, then with his 
eyes open at the same time counting by threes, thus, 3, 6, 9, etc ; 
and finally to write his name nhilc counting by twos In the 
group test the order of events is writing the name (a) with eyes 
open, (b) with eyes closed, (c) with eyes fixed on a pencil held 
by the examiner, at the same time counting aloud the number 
of times the pencil is tapped on the table, and (d) writing the 
name and at the same time keeping tiack of the number of 
times the word fly is spoken by the examiner when he reads a 
list of words that rime with fly, such as die, sigh, and he This 
test is scored by reference to a table which takes into account 
both speed and size of writing Miss Downey’s previous researches 
showed that some individuals who scoied higher on “motor im- 
pulsion” or “muscular tension” than average tend to speed up 
and enlarge the handwriting when working under distraction. 

6 Reaction to contradiction. Early in the test the subject is 
asked to choose between two envelopes. Later the subject is asked 
to state which envelope he chose, whereupon the examiner con- 
tradicts him and denies that that was the envelope chosen The 
subject IS scored high*' if he persists in maintaining that he was 
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correct in stating which envelope he had previously chosen He 
is scoicd low if lie weakly gives m and admits that he was 
mistaken. 

7. Resistance to opposition. In this test a small obstruction 
(such as a pasteboard box) is placed in front of the pen-point, 
exerting enough pressure so that to continue writing will require 
considerable effoit The subject is told to write his name with 
his eyes closed The test is scored qualitatively by comparing the 
behavior “resisting opposition” with that described in a scale. 
Since this test does not lend itself to group testing, it is not in- 
cluded in the group scale 

8 Finality of judgment The pairs of traits on which the sub- 
ject earlier lated himself is again presented to the subject, and 
he is given the opportunity to make any changes he wishes to. 
The score is the time consumed in icchecking, the assumption 
being that the longei time consumed in icchecking, the less satis- 
fied the subject is with the original rating. 

The two tests “Reaction to Contradiction” and “Resistance 
to Opposition” do not lend themselves to group testing, and in 
the group foim hate been replaced by two other tests One of 
them, called “Self-Confidence,” consists of sixteen true-false 
items which concern a list of ten nords read by the examiner 
earlier in the test The items of which the subject is absolutely 
sure he is instructed to underline, and the score on this test is the 
number of statements so undcilined A second test is called “Non- 
Compliance ” In this the subject is scored according to the num- 
ber of changes he makes in his answers to a true-false test when 
told that eight of the sentences aie true and eight are false. These 
are intended to seive approximately the same purpose as the 
two tests for which they are substituted in the individual 
test 

p Motor inhibition. Here the subject Is told to write “United 
States of America” as slowly as possible. The score is the time 
taken. In the group test the subject is Instructed to move his 
pencil as slowly as possible along a dotted line, and the score 
is the number of units traced in a given time. This test requires 
a great deal of self-control, and many persons become very irri- 
tated at the task. 

10. Interest in detail. The subject is instructed to copy some 
handwriting printed in the test booklet. This is done twice, once 
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as rapidly as possible and again at a normal rate, but copying 
as exactly as possible The score is a combination of the differ- 
ence in speeds and ths dcgiee lo which the model is approximated. 

11. Cooidviation of impulses. Ileie the subject is instructed to 
write the woids “United States of America” on a line a little 
over an inch long He is told to write veiy rapidly and to take 
care not to run over the line In the group test this test is scored 
by the number of letters omitted oi which run over the line In 
the individual test the degree to which the time approximates the 
time for normal writing is also taken into account 

12. Volitional peiseveration In the test for flexibility in which 
the subject attempts to disguise his handwriting he is given a cer- 
tain amount of time for practice In the individual test he is 
scored according to the length of time he elects to spend in prac- 
tice. In the group test six minutes of piactice is allowed alto- 
gether. During this time the examiner is copying the numbers 
5, 10, 15, etc , on the blackboard at the elapse of so many sec- 
onds, and the subject is instructed to copy in his test booklet 
the last number wiitten on the blackboard as he looks up from 
his practice, thereby giving a record of the time. 

In summary, the will-temperament tests attempt to* determine 
fundamental character istics of reaction in an individual, using 
handwriting as the testing medium Handwriting is con- 
venient for the purposes of testing and is also a semi-automatic 
response which seems to exhibit personality differences when 
under the influence of various distractions or directions If the 
tests actually do what is claimed for them, they should be very 
valuable instruments for diagnosing important features of per- 
sonality and should find immediate usefulness in personnel work 
m industry, education, and medicine. 

Downey instructs that a total score is not to be found for 
the test as a v/hole, inasmuch as each separate test measures a 
different quality, and a total score would accordingly be mean- 
ingless. Rather she suggests plotting each score in a “profile,” 
so that an individual’s relative position in each test may be 
seen. 

The first item in which we are interested in the study of the 
value of the tests is the reliability of each separate test. Several 
studies of reliability of the Downey Will-Temperament Test 
have been made R^iability is difficult to determine with total 
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Table 71 

Reliability Coefficients on the Group Test, Downey Will-Temperament 

Test 


(from Downey and Uhrbrock, 66 , pp 30-35) 


Test 

REPETITION ONE DAY APART 

REPETITION 

ONE 

MONTH 

APART 

Average 

149 

normal 

college 

women 

42 junior 
high 
school 
boys 

37 high 
school 
girls 

136 boys 


I Speed of Movement, 

II-i 

81 

86 

33 

79 

70 

II-2 

80 

83 

79 

89 

83 

VI-i 

73 

82 

62 

SO 

67 

VI-2 

81 

76 

78 

81 

79 

2 Freedom from Load, 

VI-i and 2 

72 

39 

•31 


44 

VI-2 - VI-l 

3 Flexibility Test, VIII 

7 S 

67 

76 

3 S 

31 

62 

4 Speed of Decision, Test I 

63 

64 

.64 

49 

60 

5 Motor Impulsion Test, X-i 

.72 

7 S 

61 

SI 

6S 

(Total letter count) 
Test X-i (av millimeter 
measure) 

89 

89 

.84 

76 

8S 

Test X-2, 3, 4 (av let- 
ter count) 

79 

67 

72 

S8 

69 

Test X-2, 3, 4 (av milli- 
meter measure) 

80 

.84 

.84 

84 

83 

Sum of ratios (perform- 
ance m test X-2, 3, 4, — 
Test X-I, for letter count 
and millimeter measure) 

6 Self-Confidence Test XI 

SI 

09 

S 7 

23 

12 

•32 

7 Non-Compliance, Test XII 

31 

36 

SO 

36 

38 

8 Finality of Judgment, Test 
XIII 

46 

37 

.26 

49 

40 

9 Motor Inhibition, Test, 
III 

67 

44 

.46 

66 

S6 

VII-3 

73 

76 

.68 

63 

•70 

-10 Interest in Detail, Test IX 

42 

66 

.64 

41 

S 3 

II. Coordination of Impulses, 
Test V 

47 

68 

.56 

.38 

•S* 

(score in millimeters) 

1 2 Volitional Perseveration, 
Test VIII-2 

33 

40 

.32 

•34 

•35 

Average 

650 

626 

JS 92 

522 

602 ■ 
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satisfaction on these tests. In the first place, it is unfair to judge 
reliability on the basis of repetition of many of the tests, for 
memory would so opciate as to vitiate the results. In the case 
of the character rating oi length of time of practice in disguising 
one’s handwiiting, memory would prevent the correlation of the 
two tests from being a true measure of reliability On the other 
hand most of the tests cannot be split into halves. Downey and 
Uhrbrock ( 66 ) and later Uhrbrock ( 97 ) piesent correlations on 
the repetition of the test which are the best that we have to 
serve as reliability coefficients. 

These correlations are highest where the test score is a measure 
of speed, where the skill tested is one well practised, where it is 
objectively measured, and where larger amounts of material 
make up the test, or where there are longer time limits. The facts 
show that the average reliability is greater for adults than for 
junioi high school pupils, and that the correlations are higher 
when the interval between tests is a day than when it is a 
month, but it is not possible to generalize on these two points. 

The reliability of the tests is not uniform, being considerably 
higher in some cases than in others. However, the reliability for 
a considerable number of the tests is sufficiently high so that 
the major criticism of the test cannot rest on this point. 

The second critical check which may be applied concerns the 
intercorrelations of the tests one with another These intercor- 
relations should be particularly significant within each of the 
three groups comprising the test batteiy, for the four tests in 
each of these groups are supposed to measure somewhat the 
same thing. The most comprehensive report of intercorrelations 
(Uhrbrock) contains tables which cannot be repeated here 
in detail. One will be given to indicate the nature of his 
results. 

Uhrbrock^ beside trying out the Downey Will-Temperament 
Tests, tried out many other tests which ate called by the same 
name as the Downey tests. For instance, besides the test of speed 
of handwriting used by Downey, he tried out a variety of other 
tests of speed of movement such as tapping with right hand, 
tapping with right foot, tapping with left foot, reading simple 
piose rapidly, color-naming, etc For each of these he gives inter- 
con elations, correlations with the Downey test, and correlations 
with a composite of 411 tests used Similar work was done with 
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other functions In the following table average correlations are 
derived from correlations given by Uhrbrock 

Table 73 

Average Intercorrelation of the Downey Will-Temperament Tests 
(from Uhrbrock) 



Jfo of 

Average 


correlattons 

correlation 

Speed of movement 

9 Downey tests 

36 

+ 26 

12 miscellaneous tests 

66 

-i- 21 

9 Downey tests with 12 miscellaneous tests 

96 

-t" 08 

9 Downey tests with composite 

8 

+ 36 

12 miscellaneous tests with composite . . 

12 

+ 48 

Motor inhibition 

5 Downey tests 

10 

+ 34 

3 miscellaneous tests . 

6 

+ 48 

4 Downey tests with 3 miscellaneous tests 

12 

-b 20 

4 Downey tests with composite 

S 

+ SO 

3 miscellaneous tests with composite . . 

3 

+ S6 

Speed of decision 

S Downey tests ... 

10 

+ 34 

2 miscellaneous tests 

I 

+ S1 

5 Downey tests with 2 miscellaneous tests 

10 

+ *S 

5 Downey tests with composite . . 

5 

+ S 9 

2 miscellaneous tests with composite . 

2 

+ 60 

Freedom from load, 4 Downey tests 

6 

08 

Fletibility, 3 Downey tests .... 

3 

.11 

Motor impulsion, 4 Downey tests 

6 

25 

Self-confidence, 4 Downey tests 

6 

— 02 

Non-compliance, 3 Downey tests 

3 

OS 

Finality of judgment, 3 Downey tests 

3 

19 

Interest in detail, 3 Downey tests . . 

3 

37 

Coordination of impulses, 4 Downey tests . 

6 

26 

Volitional perseveration, 4 Downey tests . . . 

6 

08 


Ruch (88, 89) in an earlier study obtained the intercorrelations 
within each of the three groups of tests. The average intercor- 
relation for the four tests designed to measure the “hair-trigger” 
type of response was — 005 ; for the four tests supposed to in- 
dicate the wilful, aggressive type, -I-.04; and for the tests which 
purport to measure accuracy and tenacity, -(- 06. 

If one may summarize on the basis of these averages, the inter- 
correlations are relatively low, so low indeed, that even in the 
Downey series one test cannot be said to measure the same thing 
as another test bearing the same name. The correlations with 
the composites indicate that the separate tests measure the true 
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function (if there be one) of which each test is a representative 
with correlations ranging from 40 to 60 

One must conclude in the first place that a serious error has 
been made in labeling the tests in the Downey series with general 
names such as “speed of decision” or “motor inhibition ” The 
names are exceedingly technical and are not those that are in 
common usage even among psychologists. Consequently, it is 
impossible for most persons either to interpret the results of a 
profile so as to gather its significance, or to comprehend the 
terms used so as to recognize them in the person whose profile 
It is. “Freedom from load” means little to most persons until 
they have tried the test itself and see exactly what function is 
being tested But this obscurity in the names of the tests is not 
their most serious criticism The fact that tests bearing the same 
name correlate so low with each other is evidence that each 
test IS measuring something very specific and that it is not 
possible to make inferences from the test results as to the per- 
sonality of the individual. 

This criticism of itself is sufficient to discourage the use of 
the Downey Will-Temperament Tests in any practical situation 
and to cast disci edit on the value of any results obtained from 
their use If, then, interpretations from the test results go no 
further than the tests themselves, of what value can they be? 
We are not directly interested in a practical way in how fast 
one can write or how slowly one can write, or whether one is 
able to disguise his handwriting Data on these points are valuable 
only insofar as they indicate similar trends in the personality 
under circumstances that bear resemblance to the test situation. 
But the results show that this consistency is absent, and that a 
person may show speed in judging the taste of food or the height 
of a building and at the same time be slow in ascribing qualities 
to himself 

A third method of measuring the validity of the Downey Will- 
Temperament Tests is to determine the correlation between the 
tests and ratings on the qualities measured by the tests. Several 
studies have been made in which estimates of the qualities were 
obtained on groups of subj'ects to whom the tests had been given 
Several psychologists, including Dr. Downey herself, have pointed 
out that these comparisons are not strictly fair, for such rea- 
sons as the well-known inaccuracies in the rating method, the 
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% 

difficulty in interpreting the names by which the Downey tests 
are knoun, and the difficulty in perceiving these qualities in 
otheis Hoivcver, these expeiimcnts have been earned through in 
good faith, and certainly the Downey tests ought to be subject 
to the same checks that aie used on other tests 
The correlations obtained betw'een the Downey Will-Tempera- 
ment Tests and ratings of the same traits are given m the fol- 
lowing table: 

Table 74 

Correlations between Scores on the Downey Will-Temperament Test 
AND Rating for the Same Traits 

Ruch and 



Ruch <“> 

Ruch 

DelManzo^’^ 

Downey 

faculty 

student 

pooled estimates 

JFtll-Tempera- 

estimates on 

estimates on 

on 146 high 

ment Test 

20 students 

20 students 

school students 

A 

02 

43 

Y1 

B 

— 09 

27 

02 

C 

4 S 

17 

30 

D 

— ca 

S 3 

-29 

E 

SI 

SO 

10 

F 

.23 

37 

— 03 

G 

•35 

28 

— 04 

H 

— 11 

.07 

17 

1 

— 33 

— 23 

— 09 

J 

IS 

—.07 

St 

K 

— 09 

OS 

S 3 

L 

—.26 

OS 

17 

Average 

07 

20 

- 14 


Meier 

Herskowitt 



estimates 

■mter-ratmgs of three 

Downey 

by three judges 

groups of persons, 


Wdl'Tempera- 

on 106 high 

7, 8, and p persons 

ment Test 

school students 

in each group 

Average 

A 

.21 

— 03 

.20 

B 

07 

.iB 

09 

C 

.11 

.12 

23 

D 

19 

— OS 

.07 

E 

.10 

21 

28 

F 

•05 

.10 

.14 

G 

.14 

.18 

18 

H 

.14 

•35 

.12 

I 

.24 

.29 

— 02 

J 

.21 

00 

.16 

K 

07 

.28 

17 

L 

■03 

— H 

—.03 

Average 

.13 

.12 

•13 
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These correlations with ratings are consistently low and indi- 
cate that there is little relationship between the traits as meas- 
uied by the tests and as estimated by careful observers. 

With the mam lines of evidence in hand we need go no further 
in order to draw conclusions as to the value of the tests How- 
ever, before doing so we wish to refer to two other types of 
experimental check worth recording One bit of evidence pre- 
pared by Downey (6i), fourth among the experimental checks, 
was the ability to recognize the person whose profile is under 
scrutiny A group of judges was given twelve profiles and a list 
of the names of the persons to whom these belonged The task 
was to match the name with the profile (6i, p. 286). “Correct 
identification of profiles ran from o to 5 out of 12, or from total 
failure to identify any piofile (i judge) to 41 per cent of suc- 
cessful identification (2 judges). The percentage of successes for 
the total of 144 judgments (12 judgments by each of 12 judges) 
was 22, where chance success would be less than i per cent ” 

These results must be considered unsatisfactory. At another 
time Downey (61) submitted profiles in groups of three with 
instructions to select the profile that best fitted a given person. 
In series A the piofiles weie similar, while in series B they repre- 
sented contrasting types. The average success of judges in series A 
was 33 3, 444, and 51 3 per cent, while in series B they were 
78.7, 719, and 586 per cent. Certainly this is not a high per^ 
centage of successes and would hardly testify to the practical 
usefulness of the tests. Downey explains the failure partly on 
the ground that not only were the trait names partly ambiguous 
and obscure, but that the underlying temperamental traits them- 
selves are difficult to detect by suiface observation. 

The sixth and last test of the validity of the tests is the 
degree to which they differentiate persons already known to differ 
according to race, sex, social adjustment, etc The tendency among 
experimenters to record every difference, however small and how- 
ever statistically insignificant, warns one not to accept published 
statements as to these differences without a challenge. 

Bryant (56) found a difference of 15.5 points between delin- 
quents and non-delinquents. May (78) estimates the P. E of 
this difference to be 2 5, making the difference significant Dow- 
ney says of this, “This scale has evidently hit upon two or 
three of the more striking differences between delinquents.” 
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Wires, studying sixty-seven psychopathic patients, concludes opti- 
mistically, “There weie no instances in which a score as indi- 
cated on the profile was noticeably opposed to the personality 
of the subject as shown by the history or analysis ” Bryant de- 
scribes her delinquents as wilful, aggressive, and possessing a 
greater tendency toward accuracy and tenacity than toward 
adaptability Wires describes her group as “impulsive, poorly in- 
hibited, with relatively high scores on impulsion and assurance, 
and lowered scores for resistance and inhibition ” 

McFadden and Dashiell (79) have studied white and negro 
high school and college students and have found differences in 
almost all of the will-temperament tests between Indians and 
whites Freyd (70) discovered that certain of the tests showed 
reliable differences between socially inclined and mechanically 
inclined groups. 

It IS difficult to see how two forms of the same test could give 
such low correlations and yet at the same time be such potent 
instruments for diagnosing differences between groups. Peihaps 
one should accept the evidence at its face value until repetitions 
of the experiments verify or fail to verify the findings 

The tests have also been used to study the discrepancy be- 
tween scholarship and intelligence. Poffenberger and Carpenter 
(81) studied two groups, one the “success” group with higher 
rating for school success but lower ratings for intelligence, the 
other “failure” group with lower ratings for school success and 
higher ratings for intelligence Differences between the groups 
were noted on the will-temperament tests Tests were tried in all 
possible pairs, from which thirty-one pairs were selected as pos- 
sessing differentiating ability When combined they gave an aver- 
age of plus scores of 9 4 to the success group and 3 S to the 
failure group There was also an average of minus scores of 3 6 
for the success group and 9 9 for the failure group. To the pres- 
ent writer this seems like a grand capitalization of chance dif- 
ferences. It would seem as though all the chance differences ex- 
isting in this particular set of data were ferreted out by careful 
statistical treatment and then summated to present a picture of 
the total possible effect of chance differences. Studies of this type 
need to be repeated before we can know to what degree the 
differences are real. 

Uhrbrock (97) firds the average correlation between a test 
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of intelligence and the will-temperament tests taken singly to 
be -(-08. Ruch and Del Manzo (89) report correlations between 
the will-temperament tests and an average of the Terman “Gioup 
Test of Mental Ability” and the Morgan “Group Mental Test ” 
The average of these is 17 Meier (80) repoits a correlation 
of -1-21 between “total score” on the Downey test and on the 
Terman group test B^ant (55) finds correlations of 38 be- 
tween total will-temperament and Stanford-Binet IQ, and .43 
between will-temperament and Stanford-Binet mental age The 
evidence indicates that the Will-Tempcrament tests corielate 
slightly and positively with tests of intelligence. 

Summary of Downey WiU-Temperament Tests. A brief cate- 
gorical summary of the findings should be of service in helping 
to make a final estimate of the significance and value of the 
Downey Will-Tempeiament Tests 

J. Reliability. The tests vary m reliability from about 30 to 
over 80 The aveiage reliability is about 60 Some of the tests 
involving handwriting are highly reliable Others that involve 
less habitual reactions and in which the directions are not per- 
fectly standardized are less reliable 

2 Intercorrelations. The intercorielations of tests are very near 
zero. Even the correlations between tests bearing the same name 
are very low. The correlations of separate tests in the Downey 
series with composites of several tests supposed to measure the 
same functions range from 40 to 60 This would indicate that 
the functions measured by the Downey tests are specific and 
vary appreciably with each change in the situation or the task. 
There is grave doubt whether there actually exist general con- 
facts in the majority of persons that correspond to the test names 
assigned by Downey 

5. Correlations with latmgs. The correlations between separate 
Downey tests and ratings for these same traits in the individuals 
tested are very low. This may indicate one of several things: 
either the traits themselves do not exist, or the tests are poor 
measures of the traits whose names they bear, or the qualities 
are difficult to observe and rate in others. 

4. Groups separated by race. Social adjustments and the like 
have been reported to produce or go hand in hand with differences 
in the responses to the items. If these findings are substantiated, 
they will point to the value of the tests j'egardless of the low 
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interconelations and the low correlations with ratings Some of 
the differences leported weie mere chance differences and need 
be given no further attention Other cxpciiinenteis have demon- 
strated the diffeiences to be statistically significant But even in 
these cases we do not know how impoitant the differences really 
aie, and it may tuin out in a repetition of the experiment that 
chance exaggerated certain of tlie differences 

Conclusion. The general enthusiasm tTiat originally greeted the 
Downey Will-Temperament Tests has subsided Their practical 
value in guidance, personnel selection, and the like has not been 
dcmonstiated Expeiimental work shows them to be tests of very 
specific abilities rather than tests to diagnose the more general 
personal qualities the names of which they bear. 

Summary 

From a review of all of the skilful and ingenious methods for 
testing conduct directly that have been devised, the conclusion 
stands out above all others that conduct is very specific When 
exactly the same test is repeated, the correlation is fairly high, 
perhaps around 70 or .80. But when the situation is changed 
ever so slightly, the correlation between the two similar tests 
drops, and long before the two situations seem different enough 
to be called by different names, the correlation has dropped close 
to zero. A battery of tests designed to test such a trait as per- 
sistence, or aggressiveness, or speed of decision gives results so 
varying and with so little consistency as to furnish little war- 
rant for assuming the presence of such a trait. 

These low mtercorrelations also help to explain the low cor- 
relations of these tests with outside ciiteria Since the tests are 
so specific as to fail to correlate with tests bearing the same 
name, naturally they could not be expected to correlate to any 
degree with othei factors which aie admittedly dissimilar. 

The conclusions that one draws from these results are not 
very encouraging. There are four possible things that may be 
done with performance tests in the measurement of conduct. 

I. They may be discarded as bemg so specific as to be useless 
for all practical purposes. 

2 Tests may be devised that apply to the specific situation in 
which they will be .used. Since tests are so very specific, test 
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situations must be set up which approximate as closely as pos- 
sible the situations in office, iiidustiy, school, or institution where 
they will be used If aggiessivcncss in selling is what is wanted, 
then the test situation must be one involving selling 

5 Since no one test measuies a given quality adequately, a 
variety of tests repiesentmg a range of situations in which the 
trait occurs may be devised so that the composite will be a 
satisfactory measure of the trait in question. This is what May 
and Hartshorne have done with the tests of deceit as well as 
with their later tests of helpfulness, inhibition, and peisistence. 
Thirty-two tests of deceit combined into one composite should 
give a fairly representative measure of honesty But honesty thus 
conceived is an abstiact, composite affair. It, in turn, has little 
relationship to the specific act in the specific occasion for cheat- 
ing. To know a person’s honesty in gcneial is not of high value 
in knowing how he will react in a specific situation 

We might follow this reasoning out to a more general con- 
clusion The measurement of character is a relatively useless 
and theoretical interest. Character is so very general as to give 
little inkling of w'hat to expect m a specific situation In other 
words, to select with confidence an honest bank clerk, we should 
test a candidate for honesty in handling money as a teller behind 
the grill in a bank To know his honesty in any other situation 
or even to know his honesty in general is to know little about 
his honesty in the particular situation 

How''ever, if it should happen that the more desirable the quali- 
ties a person possesses, the more integrated his conduct becomes, 
then tests would become valuable in selecting individuals It 
may be found that to find a man honest or persistent or assertive 
in any one situation is high presumption that he will be found 
persistent or assertive in other situations But the converse is 
not true, and the man who is deceitful, wdio gives up easily or is 
timid in one situation, will not necessarily exhibit these same 
characteristics in other situations. The criminal is one who usually 
has very strongly set habits which are specific in nature, but 
the good man is likely to be one whose conduct shows a higher 
degree of consistency. 

4. A fourth method of using these tests is to pick out one test 
for each of a number of different traits and weigh these tests 
in combination in order best to predict success in business or 



354 Diagnosing Personality and Conduct 

school. Since the tests have fair reliability but low intercorre- 
lations among themselves, it is very probable that a wisely se- 
lected gioup of tests weighted by the regiession equation 
technique could be found to piedict differences in delinquency 
or other forms of adjustment with a considerable degree of 
success. 

In all of these alternative methods of using performance tests 
certain practical problems of cost, difficulty of administration, 
difficulty in applying statistical techniques, and the like, arise 
which very definitely limit the use of these tests It is often 
expensive to test in the practical situation Certain ingenious 
devices must be applied which eat into time and money, as May 
and Haitshorne found out. Again, to give a well-rounded battery 
of tests is also expensive Finally, one who plans to use the re- 
gression equation technique must count the cost beforehand 
Performance tests have a real and valuable place at the present 
time in experimental work As used by Hartshorne and May they 
have revealed facts that were obtainable in no other way But 
there must be consideiable further development before tests of 
this type become a feasible tool m clinical work. 
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Chapter X 

THE FREE ASSOCIATION METHOD 


T he free association method stands apart as unique among 
experimental techniques and as one of the most potent 
tools for the diagnosis of conduct. In essence the method 
consists of presenting the subject with a list of words, to each 
of which he is requested to respond by saying the first word 
which occurs to him This makes a tiue measure of conduct be- 
cause the response is perfectly free, except as it is required that 
the first word which is aroused by the stimulus word shall be 
given. 

Sir Francis Galton, pioneer in so many matters psychological, 
was the first to make use of this method in a systematic way 
(1879) Wundt soon after experimented with the method m his 
laboratory Beginning with Krapelin in 1896, continuing through 
Aschaffenburg and culminating with the work of Jung and Riklin 
in 1906 came the noteworthy applications of the association 
method to the diagnosis of “complexes” or centers of emotional 
irritation. Another line of development was started by Munster- 
berg in 1889 through his use of the association method m the 
detection of guilt and dissimulation In 1910 Kent and Rosanoff 
published their study of the use of the free association method 
in the detection of insanity. Since the method must be used 
individually and partly also because the exploitation of intelli- 
gence tests has engrossed psychological investigators, interest in 
it has recently somewhat died down, but m view of the potential 
fertility of the method, it is due for a revival. 

As is so often the case, some of the sagest observations upon 
the method were made by the earliest workers Galton (16, 17) 
observed, for instance, that free association is never entirely free. 
To a given word there is not an equal probability that any one 
of the many thousand other words in the English language will 
be given Indeed, Galton discovered that for any individual the 
range of responses to a given woid is decidedly small, being only 
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a few words at the most He also found, m his own case, that 
many of the associations were of long standing, having been 
formed in boyhood or early youth Out of the myriad associations 
in which each woid has appeared, the few to occupy an abiding 
place m the memory must receive many repetitions or have con- 
tinued living interest These early observations have been re- 
peatedly confirmed in the later investigations To quote one. Miss 
Furst (34, ch XI), a pupil of Jung, studied the degree to which 
associations tend to become constellated in families Of this woik 
Jung says (30, p 245). 

“One might indeed think that in the experiment, where full 
scope IS given to chance, individuality would become a factor of 
the utmost importance, and that therefore we might expect a very 
great diversity and lawlessness of associations But as we see the 
opposite IS the case. Thus the daughter lives constantly in the 
same circle of ideas as her mother, not only in her thought but 
in her form of expression, indeed, she even uses the same words. 


Table 7S 

Kent-Rosanoff List of Free-Association Words 
(37. PP 37-38) 


I 

table 

26 

wish 

2 

dark 

27 

river 

3 

music 

28 

white 

4 

sickness 

29 

beautiful 

5 

man 

30 

window 

6 

deep 

31 

rough 

7 

soft 

32 

citizen 

8 

eating 

33 

foot 

9 

mountain 

34 

spider 

10 

house 

3S 

needle 

II 

black 

36 

red 

12 

mutton 

37 

sleep 

13 

comfort 

38 

anger 

14 

hand 

39 

carpet 

IS 

short 

40 

girl 

16 

fruit 

41 

high 

17 

butterfly 

42 

working 

18 

smooth 

43 

sour 

19 

command 

44 

earth 

20 

chair 

4S 

trouble 

21 

sweet 

46 

soldier 

22 

whistle 

47 

cabbage 

23 

woman 

48 

hard 

24 

cold 

49 

eagle 

2S 

slow 

SO. 

stemseb 


51 

stem 

76 

bitter 

52 

lamp 

77 

hammer 

S 3 

dream 

78 

thirsty 

S 4 

yellow 

79 

city 

SS 

bread 

80 

square 

S6 

justice 

81 

butter 

S 7 

boy 

82 

doctor 

S8 

light 

83 

loud 

S 9 

health 

84 

thief 

60 

bible 

8S 

lion 

61 

memory 

86 

joy 

62 

sheep 

87 

bed 

63 

bath 

88 

heavy 

64 

cottage 

89 

tobacco 

6s 

swift 

90 

baby 

66 

blue 

91 

moon 

67 

hungry 

92 

scissors 

68 

priest 

93 

quiet 

69 

ocean 

94 

green 

70 

head 

9 S 

salt 

71 

stove 

96 

street 

72 

long 

97 

king 

73 

religion 

98 

cheese 

74 

whiskey 

99 

blossom 

75 

child 

jpo, afraid 
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Table 76 

JuMC List (Modified by Eder) of Free-Association Words 


(34. P vii) 


1 

head 

26 

blue 

2 

green 

27 

lamp 

3 

water 

28 

carry 

4 

sing 

29 

bread 

5 

dead 

30 

rich 

6 

long 

31 

tree 

7 

ship 

32 

jump 

8 

make 

33 

pity 

9 

woman 

34 

yellow 

10 

friendly 

35 

street 

II 

bake 

36 

bury 

12 

ask 

37 

salt 

13 

cold 

38 

new 

14 

stalk 

39 

habit 

IS 

dance 

40 

pray 

16 

village 

41 

money 

17 

pond 

42 

silly 

18 

sick 

43 

book 

19 

pride 

44 

despise 

20 

bring 

4 S 

finger 

21 

ink 

46 

jolly 

22 

angry 

47 

bird 

23 

needle 

48 

walk 

24 

swim 

49 

paper 

2S 

go 

SO 

wicked 


SI 

frog 

76 

wait 

S2 

try 

77 

cow 

S 3 

hunger 

78 

name 

S 4 

white 

. 79 

luck 

SS 

child 

80 

say 

s6 

speak 

81 

table 

S 7 

pencil 

82 

naughty 

S8 

sad 

83 

brother 

59 

plum 

84 

afraid 

60 

marry 

8S 

love 

61 

home 

86 

chair 

62 

nasty 

87 

worry 

63 

glass 

88 

kiss 

64 

flight 

89 

bride 

6s 

wool 

90 

clean 

66 

big 

91 

bag 

67 

carrot 

92 

choice 

68 

give 

93 

bed 

69 

doctor 

94 

pleased 

70 

frosty 

95 

happy 

71 

flower 

96 shut 

72 

beat 

97 

wound 

73 

box 

98 

evil 

74 

old 

99 

door 

7 S 

family 

100 

insult 


What seems more flighty, more inconstant, and more lawless than 
a fancy, a rapidly passing thought^ It is not, however, lawless, 
and not free, but closely determined within the limits of the 
milieu.” 


In giving the free association test, 100 words are generally em- 
ployed. With fewer than 100 the results are not believed to be' 
reliable Some have stressed the point that only with a long list 
of words are the mode of reaction mechanized and the inhibitions 
reduced On the other hand, since fatigue sets in with a larger 
number, 100 seems to be a “happy medium ” Of the several lists 
extensively used, Jung’s has been carefully prepared to locate 
common complexes and the Kent-Rosanoff list has been made up 
“to avoid such words as are especially liable to call up personal 
experiences.” 

In making up a list, only those words should be selected which'* 
are commonly used and are presumably ijnderstood by the sub- 
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ject with the least extensive education Thorndike’s Word List * 
will be ot seivice heie Wuids of double meaning should not be 
'employed If the list is to be used for uncovering complexes, some 
' of the words included must be irrelevant, innocent, or matter-of- 
I fact words referring to things to which persons are normally well 
adjusted, while others may be relevant or critical words which 
are related to common complexes These should be arranged in a 
chance order with the exception that the first ten words or so, 
if innocent, will assist in mechanizing the mode of reaction In ' 
this connection it may be mentioned that Hull and LugofI (28) 
^found the words in Jung’s list were arranged in a regular alterna- 
\tion of stiong and weak Accordingly if the Jung list is used, it 
IS well to preface it by ten or twenty words for orientation pur- 
poses. 

' When the association experiment is given individually, there 
are two alternatives, one to present words visually for the sub- 
ject to read, the other to present them orally Since reading is a 
skill m which persons vary widely, it is found more satisfactory 
j on the whole to give the words orally 

Some genius ought to set himself to devise a method of giving 
the association experiment in groups. As things are now, if the 
purpose of the test is merely to obtain the associations, it can 
be given in groups by having tlie stimulus words dictated and 
letting subjects write their reaction words But since reaction time 
^ IS one of the most important evidences that the experiment af- 
I fords, much of the value of the method is lost when subjects 
merely write their responses. 

Elaborate devices have been used in order to record the time 
accurately Munsterberg (54) describes a set-up in which a little 
instrument is held between the lips of both the experimenter and 
the subject With such an instrument the least movement of 
speaking makes or breaks an electric current passing through an 
electric clock-work whose index moves around a dial every sec- 
ond When the experimenter moves his lips to give a word, he 
starts the pointer revolving. When the subject opens his lips to 
respond, the current Is broken and the pointer stops The associa- 
tion time can thus be measured to the thousandth part of a 
second 

■“Thorndike, E L, The Teacher’s Word Book, 2d ed , Teachers College 
Buicau of Publications (19^1) 
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Dunlap (ii) has investigated the relative merits of the chrono- 
scope and stop-watch in the measuiement of icaction time Al- 
though the stop-watch is relatively inaccurate and gives longer 
readings than the chronoscope (due to the lag in shutting off the 
watch in response to the subject's voice), it has some advantages 
over complicated apparatus which distracts the attention of the 
subject Those nho have consideied fairly the merits of the two 
methods prefer the stop-watch 

Jung (34, p 228) maintains that it is accurate enough to use a 
stop-watch registeiing fifths of a second, to be manipulated by the 
experimenter Even though considerable error may result from 
the use of such a crude instrument, Jung believes that the error 
IS still well within the limits needed by the method in its present 
stage of refinement About five associations a minute can be taken 
and recorded using this method 

In giving the test, the instructions are to answer “as quickly as 
possible with the first word that comes to your mind” or “with 
the first thing this void makes you think of” The test should 
be given with the examiner and subject alone in a room free 
from distractions It is well to have the subject seated comfort- 
ably where he cannot see the examiner, so that he will not be 
disturbed by the timing and the record 

The experimentei should record everything possible — the re- 
sponse, the icaction time, the gcneial behavior, and all move- 
ments and expressions 

Wells,"* in describing a blank for recording the report, suggests 
that the following symbols be used to indicate peculiarities in the 
response. 

c Desires to change response word 

/ Fidgets 

t Interjections not intended as response words 

I Laughs 

m Failure to understand stimulus word 

V Repeats Stimulus word 

s Speaks phrases or sentences which may contain response 
word. 

w Gives more than one response word 

X-300 No response produced within one minute 

? Asks questions about stimulus or response 

•Wells, F L, Mental Tests in Cltmcal Practice (World Book Company, 
1927). D 21 1 
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It has been found useful to go through the list a second time 
for “reproduction” Ileie one determines whether the subject 
remembers or is willing to repeat the response he made first The 
instructions are ‘‘I am going thiough the list of words once more, 
and I want you to try to give me the same words you gave me 
before If you remember or think you remember, say it. If you 
don’t, say ‘No.’ You can take as long a time as you want to 
answer.” 

Some writers, especially those of the psychoanalytic school, also 
require the subject to give the tram of associations which each 
word arouses They not only wish to have the subject respond 
^with a word, but they wish to get at what is back of the associa- 
tion. 

Before the method is discussed critically, mention should be 
made of the utter worthlessness from the experimental point of 
view of much of the work done on the association method Work- 
ers in the psychoanalytic field, usually practitioners with the 
clinical point of view, are the victims of their enthusiasm Their 
work has made them give close attention to individual cases, in 
which symptoms achieve a magnified importance Their “experi- 
ence,” with all of Its biases, overemphases, and neglects is placed 
before the results of strict scientific inquiry Mere contiguous as- 
sociations are elevated to the level of universal laws In conse- 
quence of this, many of the facts reported concerning the asso- 
ciation method in the following pages may later be overthrown 
when given more exhaustive experimental tests. 

The greatest deficiency in this experimental work is the small 
number of cases deemed to be sufficient in drawing important 
conclusions, a deficiency to some extent pardonable, since each 
test must be given individually and the method is time-consum- 
ing. Jung actually draws important conclusions fiom his experi- 
ences with a single case Much of the work has been done with 
fewer than ten cases This might not be serious if the results were 
entirely consistent among the few cases tried, but they are not. 
Many and many a time in psychological experimentation a dif- 
ference found with a few cases has dwindled down to practically 
nothing as their number has been increased. 

A second inadequacy of much of the work done arises from a 
need for special insight and skill on the part of the experimenter, 
which IS often unmet. A diagnostic method may properly require 
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skill on the part of the observer, but ought not to demand a 
sagacity not adapted to thoiough description m a manual When 
a method requires judgment, successful results from its use tend 
to become matters of choice — it becomes an art rather than a 
science 

A third error, not so much in the method itself as in the con- 
clusions drawn from it, is connected with the matter of types.^ 
One of the easiest pitfalls into which the psychologist falls is that 
of dividing people into types This error results from a failure to 
comprehend the principles of individual differences and the con- 
tinuity of these differences Those who are accustomed to working 
with large numbers of cases and with finely divided scales of 
measurement learn that whenever human traits are graded along 
a scale representing different amounts or degrees of the trait, the 
distribution practically always assumes the shape of the normal 
probability curve But the climcal worker who sees only a few 
cases IS unable to bridge gaps in the scale and falls into the easy 
explanation of types 

Wells (78), for example, In a certain experiment, divides his 
subjects into two groups, A and B, the former consisting, according 
to ratings, of persons who are intelligent, prompt, persistent, ac- 
tive, conscientious, truthful, deep of mood, sensible, and taciturn, 
while those in group B are intelligent, prompt, active, self-appre- 
ciative, loquacious, superficial in mood, and low in truthfulness, 
conscientiousness, and sensibility He fails to note that there is 
no clear dividing line between these two groups, and also that 
there is a far from perfect correlation between the different traits 
which define a given group. 

Perhaps the clearest case of the error of assuming the exist- 
ence of types was made by Marston In early work it had been'’ 
found that deception was a state characterized by long reaction 
time Marston (46) in later experimental work with only ten 
subjects found that in giving deceptive responses four had longer 
reaction times than normal, three had shorter reaction times than 
normal, and three showed no marked divergence from the normal. 
He concludes that there is a “positive” and “negative” type, quite 
overlooking the fact that individual differences may cause consid- 
erable divergence above and below the mean of the group, a mean 
probably slightly higher for subjects showing deception than for 
subjects working without deception. • 
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Classification of Response 

A large amount of work has been done m the classification of 
the responses m free association. Since answers are neither right 
nor wiong, and vary in many ways, some sort of grouping was 
found necessary before the replies could be mtei preted, and 
though most of those classifications have proved sterile and will 
not be reviewed, there are some which show significant diffciences 
in personality, and others which help m interpreting differences in 
reaction time to be mentioned later. 

Galton’s (16) original classification is of interest. He divides 
all associations into three groups, (a) those that are predominputly 
imagery, (b) those that are hutnonic and in which tendencies to 
react are uppermost, and (c) the abstract Galton believed that 
the histrionic associations were most frequent with him But indi- 
viduals differ in this respect Rusk (64) was amazed by the “as- 
tounding definiteness and vividness of children’s imagery ” 

Another famous classification is that furnished by Jung (34, 
p. 38) in woiking over the previous classification of Krapelin- 
Aschaffenburg The primary classification is’ 

A. Intrinsic (mnei) associations 
B Extrinsic (outer) associations 
C Clang (sound) associations 
D Miscellaneous associations 

In the group of intrinsic associations belong those which come 
by way of the meanings of words One suggests another because of 
some common element in the meaning. For instance, orange sug- 
gests apple, both being fruit, or water suggests wet, a quality of 
water In the gioup of extrinsic associations are those of contigu- 
ity, of mere contacts in space or time. Pen — ink is an example, or 
bitter — sweet In the thud group come those cruder associations 
of sound such as rime, or some even more elementary sound re- 
semblance such as make — shake or window — winter. Into the 
fourth group are thrown the miscellaneous associations, chief 
among which are the mediate or indirect associations in which a 
middle term is necessary to complete the link. 

This classification, expanded, is the basis of the following 
scheme taken from Jones’s Papers on Psychoanalysis ( 29, p 
431E 



The Free Association Method 


369 


A Intrinsic Association — Continuity. An essential resemblance 
present between the meanings of the stimulus and reactions 
words 


I Coordination Essential similarity 
between the two 

II Predication The reaction-word 
expresses some predicate, judg- 
ment, function, or attribute of 
the stimulus-word. 

An important sub-group is the 
defining or explaining association 

III Causal dependence The idea of 
causation implied in the response 


Apple — pear 


Snake — poisonous 
Book — something 
to read 

Pam — tears 


B 


Extrinsic Association — “Contiguity.” The resemblance present 
IS a superficial or ‘chance’ one. 


I Coexistence Simultaneous The 
two ideas connected through fre- 
quent simultaneous use 

II Identity. Synonyms or nearly so 

III Motor-speech forms The two 
words connected through fre- 
quent use in daily expression, 
proverbs, qualities. 


Pen — ink 
Effect — result 

Pen — Sword 
Caj — mouse 


C 


Sound Association The resemblance between the two words 
being primarily an auditory one. 


I Word completions 

II Clang 
III. Rime 


One — wonder 
Line — lying 
Cast — ^past 


D Miscellaneous. 


I 


II 


Mediate An indirect association, 
intelligible only on the assump- 
tion of an intermediary bond that 
does not appear in the reaction 
The association of the bond may 
be any one of the forms men- 
tioned above, and its relation to 
the stimulus word (centripetal) 
and to the reaction word (centri- 
fugal) can be separately classi- 
fied 

Senseless. No discernible connec- 
tion between the two words, in 
this case the reaction-word us- 


Run — rifle (centri- 
petal sound dis- 
junction, gun being 
the intermediary 
word) 
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ually refeis to some object In the 
immediate eiiviionment 

III Failure No leaction at all 

IV Repetition of the stimulus word 

Still another classification for which Jung (34, p. 168) is re- 
sponsible IS as follows* 

A Objective. 

I. Reaction principally conditioned 

via the objective meaning of the 
stimulus word 

II. Reaction principally conditioned 

via the linguistic features of the 
stimulus word. 

B. Subjective Egocentric. 

I. Predicate. 

a. Personal judgments Emo- 
tional Noble — man 

b Definition Intellectual Car — a vehicle for 

II. Constellation transportation 

a Simple constellation 
b Complex constellation 

A word of explanation for the two latter groups By a simple 
constellation association is meant a reaction influenced by special 
-^individual complexes strongly invested with emotion. In the 
complex type the influences aie “unconscious,” that is “the person 
is not aware of their content, which, being too unpleasant to 
remember, has been buried ” 

Still another classification scheme is that of Kent-Rosanoff (37) 

I. Common reactions 
I. Specific reactions 
2 Non-specific reactions 
II Doubtful reactions 

III. Individual reactions 

1. Normal reactions 

2. Pathological reactions 

A. Derivatives of stimulus words 

B. Partial dissociation 

(a) Non-specific reactions 

(b) Sound reactions 

i. Words 

ii. Neologisms 
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(c) Word complements 

(d) Particles of speech 
C. Complete dissociation 

(a) Perseveration 

(b) Neologisms without sound relation 
3. Unclassified 

/By a common reaction these writers mean a response that is 
given by laige numbers of people, by an individual reaction is 
meant a response given infrequently In an exhaustive study of 
the responses of 1,000 individuals, Kent and Rosanoff have drawn 
up tables of the frequency with which various responses occur, 
and these tables may be referied to for deciding whether a re- 
action IS common or individual 

These classifications aie of value only in helping one to inter-^ 
pret the significance of various responses, but to classify accu- 
rately a bundled responses is difficult if not impossible Certain 
of the classification categories overlap, others are vague, and to 
assign a response to others would require insight or skill beyond 
the reach of any experimenter, since the mere association simply 
IS not a guide to its own classification In short, such a classifica- 
tion IS at best extremely subjective. 

Factors Conditioning the Response 

Jung (34, p. 167) has shown that the type of reaction is a 
function of the quality of attention When attention is heightened, 
the association is meaningful, when attention is relaxed, inhibi- 
tions are broken down, and verbal and clang reactions are given. 
Jung holds to the somewhat fanciful explanation that these clang 
reactions are infantile and require to be inhibited in noimal pur- 
poseful activity In coiroboiation, Aschaffenburg (38, p 566) has 
found that under fatigue, associations become more superficial 
and the relation between the stimulus word and the reaction be- 
comes looser Wells (81) has shown that reactions become more 
superficial after practice, in effect also attributable to weariness 
or fatigue. Wells also finds that practice tends to differentiate and 
particularize the response A similar tendency toward superficiality 
of association has been noted as the result of the action of drugs, 
such as alcohol, tea, coffee, etc In short, as the attention is re- 
laxed. the quality of the association lowers.* 
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Jung (34, p 168) again tells us that the educated (intelligent’) 
also exhibit this distraction or dissociation phenomenon, and that 
their associations are more shallow than those of the uneducated 
He explains this (in pait) on the ground that the uneducated per- 
son lets himself go much less and allows fewer disguised subjec- 
tive wishes and valuations to break through (p 130) a psychoan- 
alytic explanation which seems the longest W'ay round A much 
simpler and more direct explanation given bv Jung is that the un- 
educated man icsponds to words as fragments of sentences and ac- 
cording to the expeiience aroused Woids are \icegerents of the 
situations they represent The uneducated person responds to a 
word subjectively', personally', as though he w'ere being asked a 
question The educated person, on the contrary', is used to consid- 
ering words by themselves, and is therefore ready to give a merely' 
verbal association. 

Allport (2) finds few'er egocentiic reactions when one is w'ork- 
ing in a group than when w'orking alone Appaiently the group 
stimulates “group” reactions, while inhibiting reactions that are 
strictly personal or egocentric 

Woodrow and Lowell (88) show' that children show fewer in- 
dividual reactions than adults, using tables of children’s frequency 
of response This contradicts findings by' RosanofT and Rosanoff 
(63), who interpreted the children’s reactions on the basis of their 
adult tables. A simple interpretation of the findings of Woodrow 
and Lowell is that children have a smaller stock of words on which 
to draw and fewer peculiar or individual experiences They have 
not yet, as a rule, met the variety which results from the vicissi- 
tudes of livmg Some of the earlier writers w'ho believed that 
children gave a larger number of individual responses tried to 
explain this by' asserting that children w'ere more proof against 
“partial dissociation ” This is only another illustration of the ease 
with which imagination tends to run ahead of the facts m this 
field. 

There seems to be a difference in testimony, also, as to the 
rehabdity of these classifications. Rosanoff, Martin, and Rosanoff 
(62) report that there is great variability when the test is re- 
peated after an interval of three minutes. On the other hand, 
Guthrie reports a reliability coefficient of 80 (one half against 
the other) when the free association test is scored by the number 
of “common” (Kent-Rosanoff) responses. _ - 
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Some of the major factors conditioning the response have been 
mentioned, including variations m the attention, education, and 
age of the subject Other major factors undoubtedly exist, but in 
the main one must look to the individual experience for an ex- 
planation of the piecise word given 


Reaction Time and Factors Related to It 

Many investigators believe that the reaction time is more signifi- 
cant than the actual reaction itself By reaction time is meant the 
interval between the instant the stimulus word is presented and the 
instant the response word is given by the subject. 

1. The first factor to be considered is that each individual has 
a reaction time that is peculiar to himself Differences in re- 
action time weie investigated by Cattell (6) long ago, and the 
facts are well known to-day On the other hand, it would be cum- 
bersome to have to obtain an individual’s reaction time tendency 
befoie using him in the association experiment. In practice it is 
common to assume a median reaction time of two seconds (Jung, 
34, p. 231, gives I 8 seconds) measuiing deviations from the nor- 
mal from this figure Assuming that individuals vary in their 
normal reaction time around this median value, let us inquire 
what other factors influence reaction time 

2. It has been noted that “free” association has a longer re- 
action time than “contsollcd” association Ask a person to give 
the opposite of hot, and the response will come more quickly than 
if you give him the word hot and ask him to tell you the first 
word he thinks of When the additional control “opposite” is 
given to the subject, the range of altcinalives is vastly decreased, 
but when one is given freedom to utter any word whatever, a time- 
consuming choice must be made from many possibilities. If for 
any reason the choice is difficult, or if the first impulse to speak 
a word is blocked, the reaction time may be considerably length- 
ened It is this that makes reaction time so serviceable in the 
diagnosis of complexes. 

j. Crane (9) finds that when the word is presented visually, 
the reaction time is longer than when it is presented orally. 

4. Woodworth-Wells (87) found that the reaction time to the 
first word in a list is considerably slower than to subsequent words 
in the list or to words given in isolation. * 
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.pjjig (34, p 234 ff ) finds that the type of stimulus word has 
the reaction time The shortest reaction time followed 
the pr«entation of concrete words (167 seconds), the longest 
reaction time followed abstract words (i 95), while verbs occupied 
an intermediate position (19) Somewhat the same thing was 
found by Crane (9), who reports that adjectives yield the short- 
est times, nouns next longest, and veibs the longest of all Prob- 
ably there are three factors at work here One is that we are 
more familiar with concrete than abstract words, and therefore 
the associations are more plentiful and more ready A second fac- 
tor IS that adjectives and nouns usually require nouns and verbs 
respectively for completion, whereas verbs do not regularly re- 
quire woids for completion of the thought Hence it is lelatively 
more difficult to obtain responses to verbs Still a third may be 
that the response to nouns and adjectives tends to be objective 
and impersonal on the whole, whereas the response to verbs tends 
to be subjective 

6 Jung (34, p 236) points out that there aie also differences 
in reaction time according to the kind of response Names of ab- 
stract concepts require the longest time to produce (i 98 seconds), 
concrete words are next (r 81), while adjectives and veibs come 
most easily (i 65 and l 66) He also reports that the educated 
take the longest time in giving concrete words, but this conclusion 
must be accepted with reserve inasmuch as it is based on only a 
few cases 

7. With regard to the reaction times for the different logical 
categories used in classification schemes. Wells (84, p 22), after 
several pages of discussion, concludes that “In the Kent-Rosanoff 
experiments there was evident no general tendency of the moie 
frequent classes of associations to be shorter or longer than the 
less frequent ones ” 

This, perhaps, is because the Kent-Rosanoff classification 
scheme is not very significant. Jung (34, p. 238) reports that 
inner associations (according to the meanings) are longer than 
outer associations (the more superficial and verbal), while the 
clang associations have the longest reaction time of all. 

8 Allport (2) found that the speed of reaction was greater 
(reaction time less) when the subjects worked in groups than 
when they worked alone. 

p Wells (81), who 'has made an extensive study of the effect of 
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practice on free association, reports that reaction time decreases 
after practice 

10. Although some of the early investigators (Ziehen especially) 
reported that the speed of association increases with age, this is 
denied by Rusk (64), who finds no definite trend of this kind in 
his work 

11. Alcohol and caffeine cause a slight decrease in reaction time, 
according to Langfeld (41) 

•p/' 12. Jung (34, p 231) points out that men have a shorter reac- 
tion time (16 seconds) than women (2 9 seconds). This phenom- 
enon has been noticed by seveial observers and may be regarded 
as well founded But the explanations are many and diverse A 
common explanation is that women are more emotional than men. 
Wells reports in this connection that women are more variable 
than men m their reaction time. 

y^i3. Jung (34, p 232) again reports that educated (intelligent?) 
subjects show a shorter reaction time than uneducated subjects, 

14. Ekdahl (13), in a recent study, presents evidence to show 
that the reaction time is influenced by the presence of the investi- 
gator He found that when the association experiment was con- 
ducted by mechanical aids without the experimenter, the reaction 
time was lower than when the experimenter was present Part of 
this may have been a confusion or inhibition due to the presence 
of another person It is well known that a skilful experimenter by 
intonation can give words a more personal appeal, arouse com- 
plexes, and lengthen reaction time 

15. Words that carry unpleasant sense qualities lengthen reac- 
tion time, according to Tolman and I Johnson (75) and also W. 
Smith (67, p 249) On the other hand, Rusk finds that there are 
no significant differences in reaction time between pleasant and 
unpleasant words for children The fact that the affective tone of 
words influences reaction time seems pretty well attested for 
adults, Tolman and I Johnson find that women are moie prone 
than men to lengthen responses to unpleasant stimulus words and 
that, in the case of women, there is a decrease in reaction time 
for words pleasantly tinged, a fact which perhaps helps to explain 
the greater variability of women. 

There is a certain consistency in all of these findings that the 
reader must trace out for himself When it is considered that reac- 
tions are more often superficial when given by men than by 
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women, when given under fatigue, under drugs, when given by 
educated peisons, and after piactice, and that the reaction time is 
shorter for outer or superficial associations, after practice, under 
the influence of drugs, for men, and for the educated, it becomes 
clear that reaction time is not a matter of chance but is under the 
influence of one or more very definite forces Not all of these 
factors have been demonstrated with finality, since m some cases 
the experimental evidence is contradictory or confusing However, 
a general consistency apparent throughout the work makes it 
appear probable that future investigations will confiim most of 
the relationships noted above. 

It should be emphasized lepeatedly that none of these influences 
on reaction time is large, and that there is great overlapping For 
example, although we may say that men have shorter reaction 
times than women, this docs not mean that all men have shorter 
reaction times than all women Since there is much overlapping 
between the sexes, the difference in the averages is very small, and 
the biserial r between sex and reaction time is very small 
^ After all of these factors (each of which is small and distinct 
though on the whole relatively unimportant) has been taken into 
account, large differences in leaction time will be found left over 
Where the influence of the foregoing factors would be manifest in 
times of tenths of a second, reaction times fiequently occui which 
are several seconds in length It is believed that such lengthened 
reaction times are caused by a “complex” or emotional blocking, 
as when the woid or the situation which the word represents 
evokes an emotional response, such a host of ideas suige in that 
the subject finds it difficult to make any response In most of the 
cases the subject tends to hide or conceal his first impulse tc 
respond as being something too personal, too revealing, or too 
disagreeable 

The evidence that lengthened reaction times are due primaiily 
to the influence of emotion was derived originally fiom clinical 
investigations in which Jung and those who preceded him discov- 
ered that there was a relation between the words showing length- 
ened reaction time and the complexes of their patients More 
recently there has been striking confirmation of this through 
experiment. Eeterson and Jung (58), and W Smith (68, p 77) 
report that there is a marked correlation between galvanometnc 
deflections and reaction time in free association. As may be noted 
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from the chapter on “Physiological Measures,” the galvanometric 
experiment is a well-established method for detecting the presence 
of emotion Smith finds the correlation between the psychogalvanic 
reflex and word association leaction time to be 47 He believes 
the power of the woid association method is increased if it is used 
in conjunction with the psychogalvanic reflex As a measure of 
emotion the galvanic reflex is supeiior to reaction time In this 
connection reference should be made to the work of Landis, 
Gullette, and Jacobsen (39), who found that reaction time was 
one of the best criteria of emotionality available. 

Another bit of evidence is the inspection of the words them- 
selves that show long reaction time In most cases the emotional 
character of the words is clearly evident Jung finds the following 
words to give unusually long reaction times needle, hop, strange, 
false, heart, pyramid, strike, threaten, lemember, ripe, to woo, 
fern, hair, nauseate, dream, papei, book, harm, softly, caress, 
family, consciousness, freedom, faith, violence, wonder. Wells (84, 
p 13), using the Kent-Rosanoff series, finds the words yielding 
the longest reaction times to be. square, command, anger, religion, 
moon, slow, health, justice, stomach, wish W Smith’s (68, p 7S) 
list of words yielding long reaction times is name, friend, despise, 
make, sell, pioud, home, nasty, mairy, habit, pity, happy, angry, 
bring, dance, worry, kiss, brother, family. 

^=^W. Smith (68) siso tried out the reproduction of response words 
and the memory for them after a considerable lapse of time He 
finds that words with high affectivity aie best remembered, but 
that words soonest forgotten have an appreciably higher affective 
value than words which are moderately well remembered From 
this he infers that there are two distinct varieties of affective tone 
which he calls positive and negative, which correspond somewhat 
to pleasant and unpleasant He finds that prolongation of reaction 
time IS, in general, a sign of negative or unpleasant affective tone. 

In this connection Smith (68, ch. V) believes that the word j 
association method may be used for determining the integration j 
'■'of personality. He finds, for instance, that the reaction time for the 
list of words for a given individual tends to correlate 65 with a ' 
repetition of the list for that individual The average correlation I 
of two individuals is about -f- 15 If an individual is consistent in | 
his responses, his personality is highly unified, whereas if he is 
inconsistent there is evidence of unstability or dissociation. d 
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Wells (81) points out that the value of the association experi- 
ment for measuring the affectivity of the subject decreases with 
practice 

In summary the evidence is conclusive that the reaction time of 
word association is a valuable measure of the affectivity of a 
subject in gencial and to certain situations Although the word 
association reaction time is first a function of the reaction time of 
the individual and second is influenced to a slight degree by a 
variety of miscellaneous factors such as sex, nature of the stim- 
ulus, nature of the response, age, education, and setting of the 
experiment, large variations in reaction time are in great measure 
due to the presence of emotion. 

Complex Signs 

Now that the value of lengthened reaction time has been dis- 
< cussed as an indication of the presence of emotion or a “complex,” 
let us turn to the general question of the value of the asso- 
ciation expeiiment for uncoveimg situations that arouse emo- 
tional responses This has been given considerable study by the 
earlier investigators, who have enumerated a long list of “signs ” 
Whereas the earlier investigators discarded any word from the 
list to which the response was irregular, the later ones found 
that these irregularities and aberrations of response were ex- 
tremely significant Some of the more important complex signs are 
as follows 

I Long reaction time This is the only quantitative measure 
available. Any reaction requiring over 2 6 of a second is usually 
considered significant 

2. Inability to make any response whatever Occasionally no 
response will be elicited even though time up to a minute is 
allowed Such failure to respond may be due to a number of fac- 
tors, among which may be mentioned inhibition of any response, 
articulatory block, attention diverted by copious or diverting 
.imagery, absorption in trains of imagery or reverie, competition 
^ of reaction words , or no meaning found in the stimulus. 

J. Extremely shoit reaction time. 

4 Repetition of the stimulus word itself. 

5 Apparent misunderstanding of the stimulus word The 
psychoanalytic explanation is that in such cases there is a strong 
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desire not to understand But this explanation need not be as- 
sumed In some cases it may well be true that there was a definite 
misunderstanding due to faulty learning, or indistinct or strange 
pronunciation on the part of the examiner Perhaps in all such 
cases the prepotency of any part of the word is also influenced by 
competing ideas or images perseverating from previous associa- 
tions. 

6 Defective reproduction of original reaction at second pre- 
sentation of the stimulus word In the reproduction experiment, if 
the second response differs from the first, suspicion of a source of 
irritation arises Smith (68, p. 78) believes this to be the best 
complex indicator.* 

7. Response with the same reaction word to two or more dif- 
ferent stimulus words This is sometimes called perseveration 
Perseveration may be due to a certain complex or constellation 
which dominates consciousness, or to a poverty of ideas or to other 
more significant causes In some cases where the subject suspects 
the nature of the experiment he may avail himself of perseveration 
to assist in concealment. 

8 . Strange or apparently senseless reaction. 

g. Perseveration of ideas. In this case, though the exact word 
may not be repeated as in number 7, the same idea perseverates 
in two more responses. 

A variety of other complex signs have been noted, especially 
peculiarities in the response and uneasiness m the behavior of the 
subject. Even when the response itself seems to have peculiar 
significance, unless the experimenter is confident of his judgment 
he should be alert to prevent his imagination from imparting sig- 
nificance where it does not really exist Suspicion of emotional 
irritation may be aroused by such mannerisms as whispering the 
response, bodily movements, nervous movements of the hands, 
reddening of the face, coughing, clearing the throat. In all proba- 
bility many valid physiological indices could be determined by 
laboratory instruments such as changes in the pulse, blood-pres- 
sure, breathing,** knee-jerk, strength of grip, etc. Giving more 

• In this connection Peterson and Jung report that in correlating re- 
sponses in the reproduction experiment with the psychogalvanic reflex they 
found altered reproduction associated with increased deflection of the galvano- 
metric needle 

** Nunberg, who has studied variations in breathing under emotion, believes 
that “unconscious” complexes show strong inhibition of respiration, whereas 
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than one word in response is a very suspicious sign * KoAs (38) 
lists such types of response as quotations, titles, sentences, addi- 
tion of the article, example {the petson), or responses in a foreign 
language, as being significant A familiar subterfuge which may 
betiay the pieseiice of a complex is the naming of some article in 
the examiner’s room. 

Thanks to the work of Hull and Lugoff (28), we have an experi- 
mental check on several of these “signs” as indicators of a com- 
plex. They obtained responses to a list of 100 words from fifty 
men and fifty women ** The occurrence of the first nine of the 
complex signs listed above was noticed Correlations weie com- 
puted between each of the “signs” and also between each sign and 
a composite of all signs grouped together. These correlations are 
shown in the accompanying table 

Tadle 77 

Correlations between Complex Signs in Free Association 
(from Hull and LiigolT, 28 p 121) 
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V 
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20 15 

+ 24—31 

— 09 -f 02 

— 06 
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From this Hull and Lugoff conclude that (a) repetition of the 
stimulus word is decidedly the most reliable diagnostic sign of. 

with “conscious” complexes there is excitation in addition to the inhibition 
Nunberg, H , “liber Korperliche Begleiterscheinungen Assoziativer Vorganger,” 
/ / Psychol 11 Neurol, 16 102-128 (1910) 

•In this connection Jung (34. p 2oz) remarks that the feeble-minded sel- 
dom react with one word and those with low mentality have a tendency to 
give phrases or sentences rather than a smgle word Often the tendency is to 
give a definition of the stimulus word Accordingly one must not interpret the 
tendency to give more than one word as a symptom of the presence of a com- 
plex unless other signs give corroboration 

••No significant sex differences were found Certain words differed markedly 
in the number of signs yielded by men and women, but on the whole the mer 
and women showed about ihe same number of signs on the various words 
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the five indicators examined, (b) the first four given in the table 
are in all piobabiliLy ical complex signs, (c) repeated use of the 
same reaction woid is a complex Sign of very little value 

As to extremely short reaction times, Hull and Lugoff find that 
they are neutial 111 indicating the presence of a complex. 

Finally, Hull and Lugoff studied expeiimentally a point pre- 
viously emphasized in the literature that one should not depend on 
the occurrence of one of the signs for deciding the presence of a 
complex The eailier useis of the method concluded, “One may 
enter a judgment only when in possession of an aggregate of 
indicators ” Different individuals piobably betray their complexes 
in diffeient ways But Hull and Lugoff believe that the increase in 
diagnostic potency resulting fiom the addition of successive indi- 
cators follows a law of diminishing returns “While tw^o mdicatois 
aie distinctly more significant than one, the second indicator adds 
by no means as much diagnostic potency as the first ” 

One of the phenomena stiessed by Jung is the tendency for the 
emotional tone of a stimulated complex to persist in the next 
few responses, 1 e , to perseverate, so that the tendency to make 
a long reaction time to a certain word might last over the two or 
three following words which were in themselves innocuous It has 
been observed that a woid containing consideiablc affect may 
evoke a response having a normal reaction lime, but that the affect 
persists and may cause long reaction times in the next few woids 
For instance lose may immediately arouse the memory of clothes 
lost m a fire, and the response clothes may be given with no in- 
crease in reaction time But the train of thought aroused may 
persist and mterfeie with normal response to the next word or 
two. Hubbard (27) studied this experimentally and found that 
this observation had no justification in fact, that the perseveration 
tendency, if such there be, is too small to be of diagnostic sig- 
nificance. 

In the same experiment Hubbard discovered the important fact 
that the significance of a complex sign varied according to its 
position in the series of words. For instance, long reaction times 
are more significant at the beginning or end of the series be- 
cause reaction times normally tend to be longer m the middle 
of the series Again the number of individual responses (in the 
Kent-Rosanoff sense) increases as one advances further m the 


senes. 
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The Free Association Experiment as a Test of 
Lying and Guilt 

It was Munsterberg (54) who first studied the use of the free 
association experiment for the detection of lying and guilt In a 
senes of popular articles he gives a vivid picture of the possi- 
bilities of the method m catching and convicting criminals and 
predicts the use of it as an adjunct m the opeiation of the law. 
Though certain difficulties have prevented its practical application, 
the principle underlying it is secure, and it may confidently be 
said to await only the skilful technician to put it to legitimate and 
practical use 

A description of the method, as worked up into a clever and 
startling laboratory exercise, provides perhaps the clearest ex- 
position of Its nature As Munsterberg pointed out, a criminal 
who IS hiding something has to suppress reactions of an emotional 
nature This inhibition will lead to an increased reaction time to 
critical words presented for response On the other hand, a hard- 
ened criminal who has nothing to conceal shows no emotion and 
hence makes no betraying reactions in the association experiment. 

The following description of a class experiment in the detection 
of suppressed ideas is quoted from the Elementary Laboratory 
Course in Psychology by Langfeld and Allport (42, pp. 112, 113). 

“Two subjects are selected, and are given a sealed envelope con- 
taining instructions for the ciime The two subjects leave the room 
and decide between themselves which is to play the criminal The 
‘innocent’ subject waits in the hall or in an adjoining room until he 
IS called 

“The ‘criminal’ now opens the envelope and follows the instruc- 
tions given It IS important that the innocent person should know 
nothing about the crime, and that no one but the two subjects 
should know which one is guilty. The crime, which should be 
arranged beforehand by the experimenter, should preferably be 
one which aiouses a certain degree of ‘criminal consciousness ’ The 
following IS given as a model for the instruction in the envelope 

“ ‘Go into room — , and there you will find in the desk drawer 
a sealed letter addressed to one of the members of the class Open 
It, tearing the envelope as little as possible, and read the contents 
carefully Then replace the letter m the envelope and, with the 
aid of the mucilage you will find on the desk, reseal the flap of the 
envelope neatly so as to avoid detection. Then put the letter back 
in Its original place 113. the drawer. 
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“ When you are called into the class-room for the experiment, 
use all your ingenuity to conceal your guilty knowledge of the 
crime ’ 

“In a crime of this sort the letter should reveal some very per- 
sonal, though fictitious affair of the student to whom it was ad- 
dressed, such as financial difficulties or a pateinal reprimand for 
debts and extravagance, etc. 

“After the instructions have been carried out, the ‘criminal’ 
joins the ‘innocent’ subject and awaits summons from the ‘court.’ 

“In the meantime the expeiimentcr informs the class of the 
details of the crime. A list of fifty common words has been pre- 
pared beforehand by the experimenter Of these %vords twenty-five 
are closely related to the details of the crime Such words are 
called ‘crucial words ’ The crucial words should be distributed 
throughout the list, some occurring singly, others in groups of 
from three to five A partial list is given below as an illustration, 
supposing the crime to have been the one described (though 
actually a totally different crime must be prepared) • 


chair 

disk 

church 

hat 

black 

fork 

horse 

supper 

coffee 

man 

broom 

mucilage 

dinner 

pencil 

drawer 

keys 

letter 

money 

watch 

open 

expense 

window 

debts 

etc. 

envelope 

tree 


telephone 

school 



“One of the subjects is now called in and seated facing the class 
The experimenter speaks the first word of the list distinctly, and 
the subject replies as quickly as possible with the first word that 
comes into his mind ” 

A number of experiments in the detection of guilt have been 
reported in the literature In general the results show that with 
naive subjects unacquainted with the method, the success of detec- 
tion IS remarkable. For Instance, Leach and Washburn (44) tried 
out the method on twenty-six subj’ects with two judges, who 
studied the results independently and made only one error in the 
fifty-two judgments Langfeld (40) in an experiment on two sub- 
jects, one “guilty” and one “innocent,” reports that the innocent 
subject showed reaction times to the crucial words averaging 37 
of a second longer than those to the non-crucial words, while the 
guilty subject showed a difference in the same direction amount- 
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ing to 83 of a second The delay for the guilty subject was over 
twice that for the innocent subject. 

The question always arises whether m this experiment one 
should use reaction time as the sole criterion of guilt, or include 
also a qualitative analysis of the responses to the critical words 
The general consensus of opinion is that the qualitative analysis 
is so unreliable as to be practically worthless Often an innocent 
subject will give a word response that might be interpreted as 
showing evidence of crime merely because that response is a 
common one to that stimulus word. On the other hand, guilty sub- 
jects will frequently misinterpret a critical word and give an 
innocuous response Coupled with this is the tendency on the part 
of the examiner to let his imagination construct hypothetical 
relationships in no wise responsible for the reaction. The only safe 
course is to rely strictly on reaction time. 

Reaction time would yield overwhelming evidence were it not 
for the fact that in any innocent individual a given list of words 
will touch off “private” complexes not in the least connected with 
the particular event or “crime” being investigated This cannot be 
avoided But by the laws of chance there should be as many of 
these private complexes evoked by the crucial words as by the 
non-ciucial words, and for both guilty and innocent subjects. 
Accordingly the difference in direction in response between the 
ciucial and non-crucial words should not be affected by these 
extraneous influences, and the only change should be in the direc- 
tion of making the amount of difference smaller. 

The stumblingblock to the successful use of the method in 
detecting crime in the actual situation, over and above the objec- 
tion previously mentioned that some criminals are concealing 
nothing emotionally, is that with a knowledge of the method it 
is possible to conceal deception In experimental work with naive 
subjects the method has been uniformly successful, but with 
sophisticated subjects the results are by no means so unanimously 
successful. Steinberg reports that out of twenty-three tests with 
sophisticated subjects fourteen correct and nine incorrect judg- 
ments were made using the criterion of larger average reaction- 
time for crucial words, but that only nine correct and fourteen 
ircoirect judgments were made using as a criterion the differences 
in average deviation of reaction time of crucial and non-crucial 
words. 
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There are various devices that a sophisticated subject can em- 
ploy to conceal his deceit. He may give as his response some word 
suggested by a previous word in the list or some previous response 
of his own, or a list of cities or a list of names or objects in the 
examining room. Or the subject may have responses prepared in 
advance to words that he guesses will be used as crucial stimulus 
words Since crucial words are often names or objects at the scene 
of the crime, preparation of response words to these objects might 
easily be attended to ahead of time. There is some doubt, however, 
whether this subterfuge would be uniformly successful How- 
ever, such devices, although they might control the reaction time 
to crucial words, ought to become immediately evident to the 
examiner, who could see that the subject was dodging the require- 
ments of the experiment. 

Because of the possibility that a person conversant with the 
association method may be able to conceal his guilt, the method 
is not at the present time a sure-fire technique. In the hands of a 
skilled examiner it probably could always be used on a naive 
subject with successful results. It is probable, also, that a dever 
detective, willing to follow up cues, might make successful use of 
the association method even where the subject tried to conceal his 
acquaintance with the criminal situation But the method cannot 
at present be called entirely dependable m the detection of lying 
and guilt. 


The Association Method and Insanity 

To Kent and Rosanoff (37) belongs the credit for investigating 
the possibilities of the free association experiment in the detection 
of insanity. Their approach is somewhat different from that dis- 
cussed in the foregoing pages Considering insanity as idiosyn- 
crasy, these investigators worked on the assumption that mental 
abnormality could be discovered by noting the number of unusual 
associations given to a list of words. Accordingly their first task 
was to compile the frequency of association of 1,000 normal sub- 
jects for 100 words. These have been presented m a set of 
frequency tables, one for each stimulus word, showing number of 
times each response word is given by the 1,000 subjects A common 
reaction is defined as one to be found in the tables, and an indi- 
vidual reaction as one which is not found, m the tables, Apy 
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reaction word which, though not in the table fn its identical form, 
is a grammatical variant of a word found there, may be classed as 
doubtful. 

Using the same list on 247 insane subjects, these authors were 
able to draw up the following table. 

Table 78 

Percentage of Normal and Insane Subjects Showing Common, Doubtful, 
AND Individual Responses in Free Association 

Common Doubtful Indmidual 

reactions reactions reactions 

1,000 normal subjects . 91 7 % I S% 6 8 % 

247 insane subjects 70 7% 2 5% 26 85 o 

This corresponds to a coefficient of contingency of 26 (uncor- 
rected) or 48 (corrected), and indicates that the method has some 
diagnostic power. Mateer, who has used the Kent-Rosanoff list 
in clinical practice, suggests the following standards (47, p 289) 

“If a child gives more than 10 individual reactions on the Kent- 
Rosanoff association test and is more than eight years old, or if 
he gives more than 45 out of the most common reactions and is, 
in this latter instance, of normal level in intelligence, it may be 
taken as an indication of psychopathy. 

“When the analyzed Kent-Rosanoff associations test is studied 
for quality, the test may be counted psychopathic in its indica- 
tions if more than 10 reactions are found which are abnormal 
according to the Kent-Rosanoff definition of abnormality, or if 
they give that many indications of perseveration, automatism, 
sound association, repetition of the stimulus word, etc.” 

These workers went further and attempted to use the associa- 
tion method for diagnosing special types of insanity. First they 
drew up a scheme for the classification of responses (described on 
p. 370) and studied the differences in response made by different 
insane groups Their table, which is too bulky to include here, 
reveals several significant differences in type of response They 
find, for instance, that m dementia praecox there is a tendency to 
(a) give neologisms, particularly those of the senseless type, (b) 
to give unclassified reactions, largely of the incoherent type, and 
(c) toward stereotypy, manifested chiefly by abnormally frequent 
repetitions of the same reaction.* Other types of insanity do not 

•Peterson and Jung (58) report no appreciable lengthening of reaction time 
in dementia prsecoz, but since this conclusion is based on the observation of 
only two cases, it may be dismissed as unproved. 
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show such clear-cut distinctions^ The paronoiac cases are far from 
homogeneous in the types of responses which they give Epileptics 
show tendencies to repeat one word or another many times, and an 
abnoimally pronounced tendency to make use of non-specific 
reactions or particles of speech Cases of paresis when mild give 
normal reactions, when severe, theie is a tendency to persevera- 
tion In cases of manic-depressive insanity there is again no 
uniformity in type of response. 

One must confess that the work of Kent-Rosanoff, although 
representing an immense amount of industry, is decidedly faulty 
from the statistical point of view Like so many investigators with 
training in medical science, they have a tendency to magnify indi- 
vidual observations and to fail to note inconsistencies In the first 
place, as admitted by them, the method of classification of re- 
sponses has a large element of subjectivity, a factor which cannot 
be easily dismissed and which remains as a cause of error in the 
results. Again, the inconsistencies and overlapping of response 
between different groups make diagnosis by this method alone 
very dubious In the repoits are records from normal subjects 
who make many perseverations, or unclassified reactions, or sound 
reactions, or non-specific reactions, and give every evidence of 
being abnormal There is no sharp line of division between the 
sane and insane, but on the contiary a very gradual overlapping 
The method, therefore, is presumptive or indicative only and can- 
not in Its present stage safely be used alone in the diagnosis of 
insanity. 

Free Association as a Measure of Ability 

It is strange that the association method should have been 
considered as a measure of intelligence or of school achievement. 
Superficially there would seem to be no 1 elation between the type 
of reaction given by the intelligent and non-mtelligent, except that 
the intelligent person with a larger vocabulary ought to give a 
greater number of individual words That this is the case has 
already been mentioned m this chapter. Jung points out the simi- 
larity between the type of response given by educated persons 
and by those suffering from various degrees of dissociation There 
is probably a very different set of causes lying behind this simi- 
larity m type of response. 
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Various workers, however, have noticed qualitative differences 
in the free association response among the different grades of 
feeble-mindedness Wreschner (89) found that with the idiot the 
quality of the stimulus word has a great effect on the response 
The simpler the stimulus word, the higher the quality of the 
reaction word and the shorter the reaction time, any increase in 
difficulty or abstractness of the stimulus word is accompanied by 
a more superficial response and a longer reaction time Wehrlin 
(ch. Ill, 34) finds that imbeciles seldom react with a single word, 
but receive the stimulus word as a question to be responded to 
with an answer There is also a marked tendency to definition in 
the imbecile. He tries to explain the stimulus word instead of 
responding with the first word coming to his mind. Thus we find 
a shift from the characteristic response of a phrase or sentence that 
tries to fathom the meaning of the word among the feeble-minded 
up to the other extreme of mere verbal play on words, more or 
less superficial, among the highly intelligent 

Rosanoff and Rosanoff (63), who studied the responses of chil- 
dren to the Kent-Rosanoff list, report a definite correlation 
between association and mental capacity and also between asso- 
ciation and school grade However, their tables present differences 
in type of response between the bright and dull groups and also 
between pupils in higher and lower grades which are so slight as 
probably to be statistically insignificant This difference cannot be 
computed, however, since the numbers m each group are not 
reported. 

Eastman and Rosanoff (12) present an elaborate study of the 
use of the Kent-Rosanoff list on feeble-minded children in which 
they conclude “states of arrested mental development present 
certain fairly characteristic associational tendencies, characterized 
mainly by failures of reaction, non-specific reactions, and certain 
types of individual reactions ” But here again the statistical tech- 
nique employed fails to show the significance of the differences 
found or the amount of overlapping with normal children 

T L Kelley (36) has used the free association method with a 
technique of his own devising In an early work he had studied 
the correlations of the association method with class standings in 
mathematics, science, and foreign languages. In the later experi- 
ment referred to he applied the Wells logical categories for classifi- 
cation, which have never yielded very significant results. Since 
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there were only twelve subjects, the probable errors of the co- 
efficients of correlation were large, though some of the coefficients 
turned out to be large enough to be statistically significant But 
the relationships are difficult to interpret Altogether, the method 
is so cumbersome and difficult to use for the prediction of school 
achievement, and the results are so unpromising, that this work 
has never been followed up, and probably with reason. 

Free Association as a Measure of Interest 

Later in connection with the California Survey of Gifted Chil- 
dren, Kelley encouraged Mrs J. B Wyman (74) to use the asso- 
ciation method in studying the trUellectud, social, and activity 
'^interests of children The treatment of the data in this work is 
unique and deserves a brief description After some preliminary 
experimentation two lists of sixty words each were drawn up, the 
words being selected as “adapted to provoke responses due to 
intellectual interests, social interests, or activity interests ” Chil- 
dren in the sixth and seventh grades gave their associations as in 
a group test by writing down the first word which occurred to 
them. These children were rated in various ways by their teachers 
for intellectual, social and activity interests. Finally groups of from 
fifty-eight to seventy-one children were selected as possessing or 
not possessing the different kinds of interest in question, though 
it was recognized that the two extreme groups in each case were 
merely the ends of a continuous distribution of the interests in 
question. 

Then each separate response word was studied with respect to 
the frequency with which it was given by each of the contrasted 
interest groups (74, p 463). 

“A particular response word given by a large percentage of the 
‘with’ group and by a small percentage of the ‘without’ group (01 
vice versa) has a high differentiating value and is diagnostic. Foi 
each response, therefore, the frequency was found separately for 
the ‘with’ group and the ‘without’ group, in terms of the percent 
who gave it The difference between these two percents was also 
found. Next the S. D. of each percent was computed, and the 
S. D. of the difference between the two percents. Comparing the 
differences between the percents with the S D. of the difference 
gave a measure of the amount of a particular kind of interest 
involved in a particular response word. Accordingly, the score 
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assigned to a response word was the difference between the per- 
LLUts of ‘with’ and ‘without’ gioups giving it, divided by the S D 
of this difference The latios thus obtained were tiansmitted into 
a 0-20 scale, o indicating no interest and 20 maximum interest It 
was necessary, of couise, to cany through this procedure sepa- 
rately for the three kinds of interests In all, theie were upward of 
10,880 response woids to evaluate, each three times Upward of 
13,000 additional responses were encountered in scoring the papers 
of other groups of children tested.” 

Because many of the response words w^ere similar in meaning 
or were merely grammatical variations, and also because certain 
of the response woids tvere not given by one or the other group, 
making impossible the statistical treatment mentioned above, it 
was found necessary to fall back on certain arbitrary and sub- 
jective groupings in devising a scoring key This key for a single 
stimulus word fills an entire page Following is a brief section of 
the key for the stimulus word gem. 

Scores for Responses to Stimulus Word Gem 


Frequency 

Responses 

I 

S. 

A 

44 


4 

4 

8 

1 14 

diamond 

20 

11 

IS 

79 

stone 

12 

10 

IS 

57 

j’ewel(s) 

12 

13 

s 


etc , for 72 other responses 


This means that when a given paper is being scored for intellec- 
tual interest, and the response to the word gem is given as dia- 
mond, a credit of 20 is given The sum of the credits to the sixty 
words is a measure of intellectual interest Each paper must be 
scored again for activity interest and social interest. 

The method is sound statistically, but extremely laborious in 
the effort required to obtain the key in the first place and also to 
use the key to score a set of papers Laslett (43) has used the 
same technique in constructing a free association test of de- 
linquency, but It can be doubted that a method so very long and 
tedious will ever receive wide use 

Freyd (15) used the Kent-Rosanoff series in his study of the 
mechanically and socially inclined He found that there were 
certain significant differences between the groups, for instance, the 

* From Tcrman, Genetic Studies of Genius, Vol I, 2d ed (Stanford University 
Press, 1925), 463 Used bj*. permission 
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mechanically inclined gave more individual responses, while the 
socially inclined gave more common responses 

In conclusion it may be helpful to mention certain variations of 
the method, given by Kohs (38, pp 568-572) 

His first two methods consist of a discussion of the free associa- 
tion method and the reproduction (both of the response and the 
stimulus) method Since they have already been discussed, Kohs’ 
remarks on them will not be repeated here Concerning variations 
on the reproduction method, he says 

“In these experiments, the examiner assumes that the subject 
has had a specific, vivid, emotionally-toncd experience He is then 
read a story (‘Versuchsgeschichte’) so constructed by the experi- 
menter as to be veiy similar in content to the experience in ques- 
tion. The details in the nariative, however, are incomplete, vague, 
and only cleveily approximate to the actual events Having been 
instructed to pay strict attention, the subject is requested to recall 
as much of the story as possible But the details in the narrative 
and experience being so much alike, he more or less unconsciously 
confuses those of one with those of the other, and, as a result, 
inserts and supplements items which were wholly absent from the 
story By this intimate knowledge of details, he easily betrays his 
familiarity with certain events He may have previously denied 
most emphatically the least acquaintance with a single incident. 
Neverthdess, his responses definitely point to the contrary. 

Ill The 'Aussage’ or Reproduction Method {Wertheimer) 

“This procedure is not to be confused with Jung’s The indi- 
vidual heie IS asked to reproduce 01 ally the story which has been 
read to him Using the technical terms of the ‘Aussage’ Test, this 
portion of the method is the ‘Bench t’ or leport, and may be fur- 
ther supplemented or completed by the ‘Veihor’ (the interrogatory 
or deposition) The latter may be any one or a combination of 
these thiee types First, the questions may relate to details only, 
which are common to both story and ‘Tatbestand’ (experience, or 
constellation of facts) , second, the questions may yield different 
responses owing to differences in story and ‘Tatbestand’, third, 
the questions may relate to details present only in the ‘Tatbest- 
and ’ Kramer states that this method may work perfectly in the 
laboratory, but may fail absolutely in actual 'application More 
data are necessary 

IV. The ‘Aussage’-Association Method {Wertheimer) 

“Here the subject is given an association series made up of 
words of these three types, first, those relating to similarities in 
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both the ‘Tatbestand’ and the story, second, those relating Seither 
to the ‘Tatbestand’ nor the story, third, those relating only to the 
‘Tatbestand.’ The subject is requested to state his associations only 
to those words which appeared m the story; to all others he is to 
respond with the word ‘nothing’ According to Lipmann the guilty 
easily betray themselves 


V. The Combination Method {Lipmann and Wertheimer) 

“The following is Lipmann and Wertheimer’s adaptation of the 
Ebbmghaus ‘Kombinationsmethode ’ After the subject has heard 
the ‘Versuchsgeschichtc,’ a sheet is placed before hurl containing 
the text of the stoiy, but from which many details have been 
omitted Room is left in the ‘Kombinationstext’ for the proper 
information to be inserted This information (not having been 
given at all in the story) either must be supplied from the ‘Tat- 
bestand,’ or else the text is so arranged as to draw out more or 
less forcibly a complete description of some incident inaccurately 
mentioned in the narrative. 

“Apparatus is arranged so that whenever the subject writes, 
an electric circuit is closed It is opened the moment the pencil 
leaves the paper A kymograph is used to record these ‘makes’ 
and ‘breaks’ of current Long, significant pauses may thus be 
recorded and the correlations between length of pause and char- 
acter of the supplementation noted Attempts at dissimulation 
are easily discovered, first, by a too frequent response of ‘noth- 
ing’, second, by too many unmeaning reactions, third, by a 
greatly retarded reaction procedure At the end of the experi- 
ment, Lipmann and Wertheimer had their subj’ect give an intro- 
spective account of his mental activity during the examination 
Many of those who were found to have betiayed themselves 
were most positive in their insistence that they had said nothing 
which could in any way reveal the existence of the complex. 
Rittershaus sees no value in this method. 

“In all of the three above procedures, it is admitted that inno- 
cent as well as guilty individuals will make errors in their re- 
productions The point to be emphasized, however, is that the 
guilty make significant ones 

VI Perception Method {Wertheimer and Klein) 

“Optic stimuli are presented by means of a tachistoscope The 
exposure is made for a very short time The association series 
includes (a) purely irrelevant words, (b) complex-words, (c) 
irrelevant words similar by sight or sound to those of a complex 
type, (d) incomplete words which might be interpreted by the 
subject either as irroievant or complex, e. g. CH--E may be in-< 
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terpreted in terms of the complex as CHOKE, or may be under- 
stood by the innocent individual as CHORE or CHOSE 

“Results seem to indicate that (a) so-called guilty individuals 
perceive complex stimuli more quickly than irrelevant, (b) irrel- 
evant stimuli are falsely perceived as complex, (c) incomplete 
stimuli aie inteipreted moie easily as complex than irrelevant. 

“Instead of perception being visual, it may be auditory The 
authors suggest Gutzmann’s proceduie in the use of a phono- 
graph Thus the word coarse will be heard as coipse by the 
murderer. Some of the advantages in using a phonograph have 
been already indicated Besides the ones mentioned, the phono- 
graph adds obscurity and indistinctness to the stimulus, and 
eliminates any possibility of lip-reading 

VII Distraction Method {Wertheimer and Klein) 

“The subject is given two texts, the content of one being re- 
lated to the ‘Tatbestand,’ while the content of the other is quite 
irrelevant He is requested to put a line through all r’s as rapidly 
as possible The number of f’s crossed, the time, errors, and 
behavior are noted The results seem to show that the meaning 
of the text relating to the ‘Tatbestand’ is much more distracting 
to the rigid and accurate performance of the task. 

VIII, Sentence Method (Moravcsik) 

“The experimental material consists largely of sentences ex- 
pressive of different emotional states Sentences of indifferent af- 
fective toning are also employed The subject is requested to 
respond with whatever comes into his mind No attempt is made 
to narrow down the reaction to any paiticular form oi type, or 
to any number of words Moravcsik experimented only on melan- 
choliacs and maniacs The following are samples of his material 

Stimulus Words (depression) — sadness, pain, suffering, grave, 

sms 

(exaltation) — joy, riches, dance, laughter. 
(indifferent) — garden, house, snow 
Sentences (depression) — How unfortunate I am ' 

It would be well if I were buried. 
My heart is so full of pain 
(exaltation) — The best thing is a long life 
I am full of joy! 

(indifferent) — In winter the snow falls. 

fie observed that the melancholiac found a group of words ex- 
pressive of dejection harmonious and desirable, and revolting to 
those of an exalted type. The reverse was true of the maniac. 
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He also concludes that those possessing a strongei mental bal- 
ance answer with a single word rather than with a sentence 

IX The ‘Ausfrage’ Method (Marbe, Messei , Buhler and others; 
designated ‘Ausfrage’ by Wundt) 

“The procedure is that of the simple association experiment 
A stimulus word is presented, and the subject lesponds with the 
first idea coming to consciousness Immediately following this, 
however, he is asked to give a complete introspection of all his 
mental operations during the reaction This method is of value 
only rrhere trained inlrospectionists are acting as subjects Wundt 
opposed Its use as a psychological experiment, since it lacks the 
four necessary elements, but Kakise, on the contrary, makes a 
strong plea for its employment From one point of view this 
mode of procedure resembles greatly the usual psychoanalytic 
method 

“Since some of these methods have only in one or two cases 
been given a fair and accurate trial by unbiased yet expeit 
psychometiists, for the detection of crime, it is not surprising 
to find so large a number of workers in this field skeptical as 
to their value We certainly need more data But, granting their 
worthlessness for criminal psychology, they still are of gieat 
promise to experimentalists interested in the diagnosis of indi- 
vidual complexes and constellations As yet, the procedures arc 
crude and the technique undeveloped The methods are waiting 
for a master-hand at reconstruction and improvement ” 

Summary 

That the association method is a powerful tool in the diagnosis 
of conduct has been demonstrated without a doubt The eailier 
attempts at classification have been largely abortive But the 
use of reaction time, the reproduction experiment, certain repe- 
titions, and strange twists and misinterpretations of words are 
extremely diagnostic of centers of emotional tension Though 
the method has been proved to be effective in the discovery of 
guilt, certain difficulties in its use with sophisticated subj'ects has 
prevented its practical application The association method also 
IS helpful in the diagnosis of insanity and milder psychopathic 
states, although it cannot be used as the sole criterion. Certain 
interesting and suggestive variations of the free association 
method have been summarized by Kohs. 
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Chapter XI 

PHYSIOLOGICAL MEASURES OF THE EMOTIONS 

C onduct which is accompanied by emotion has special 
significance In certain emotions the body is piepared for 
immediate and intense muscular activity, through the 
functioning of the sympathetic nervous system All the forces of 
the body are adjusted to peimit vigorous combat or flight The 
heart pumps more vigorously to carry away the fatigue poisons, 
and breathing becomes deeper and faster to aid m cariying away 
the waste carbon dioxide products resulting from increased 
metabolism Blood is withdrawn from the digestive organs so 
that the muscles may have a gieatei supply, and the gastric secie- 
tions are retarded The pupils of the eye enlarge to permit the 
entrance of more light The sweat-glands secrete, thus acting as 
a temperature legulator against the increased heat to be pro- 
duced The liver releases sugar into the blood for an enlargement 
of immediately available energy, and adrenalin is secreted to 
facilitate the removal of fatigue products and to hasten blood- 
clotting if necessary 

A man in a baffling situation has various alternatives before 
him He may postpone immediate action in favor of deliberation. 
After a careful survey of the possibilities for resolving the con- 
flict, he may make a decision and then calmly and coolly plan 
to put his decision into effect At the other extreme he may make 
an immediate and vigorous attempt to free himself from the 
baffling situaiion which confronts him accoiding to his past habit 
systems In this case his impulsive behavior receives reinforce- 
ment fiom the sympathetic nervous system, and we may say that 
he is reacting emotionally If this is the method employed, there 
IS a minimum of thought used Usually one meets life’s difficul- 
ties by a combination of deliberation and emotion 

It must be evident from the above description that the “emer- 
gency” type of behavior has only a fair chance of being satisfac- 
tory If previous hab.t has provided a good adjustment to the 
' 400 
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novel and irritating situation in the past, then the response may 
be satisfactory But this method is blind Only when the various 
alternatives in the situation have been fairly surveyed can it be 
said that the response is reasonable or intelligent Emotional 
conduct IS notoriously blind and haphazard 

Because conduct accompanied by emotion has peculiar signifi- 
cance, It IS desirable that we have measures of emotional ac- 
tivity One method of measuring this emotional activity is by 
testing for conduct which results in poor adjustment The Wood- 
worth Psychoneurotic Inventory has been called a test of emo- 
tional stability As a test of the emotions this implies that all 
of the conduct, the presence of which is indicated on the question- 
naire, IS accompanied by emotion This is on the whole probably 
a fair assumption, but the relapon between the conduct and 
the presence of emotion may not be very direct. What is needed 
is a more direct measure of the visceral changes which are char- 
acteristic of the emotional, stirred-up state. 

Of the visceral changes which characterize the emotions, some 
are inaccessible or inconvenient to get at Rubber balloons to be 
swallowed by the subject have been employed to measure changes 
in the peristaltic movements of the stomach, duodenum, and rec- 
tum by Brunswick,* but the method is so distasteful to the 
subjects that its use is very restricted Likewise, detection of the 
presence of sugar or adrenin m the blood requires high technical 
skill. The changes which are most amenable to accuiate meas- 
urement are (a) the rate of the heart-beat, (b) the amplitude 
of the heart-beat, (c) blood-pressure, (d) volume of blood in a 
limb, (e) rate of breathing, (f) amplitude of breathing, and 
(g) psychogalvanic reflex. 

Before these methods are described in detail a few general 
words concerning laboratory techniques are in order The meth- 
ods of testing for the picscnce of emotion require elaborate and 
delicate instruments which must be caiefully set up in the labora- 
tory. Consequently the measurements must be carried out in the 
laboratory usually under somewhat artificial conditions Situa- 
tions whose effect on conduct are to be tested must be imported 
into the laboratory Since blood-pressure, pulse rate, breathing, 
and perspiration vary irregularly and seem to be influenced by 

• Brunswick, D , ‘‘The Effects of Emotional Stimuli on the Gastro-Intestinal 
Tone.” Journal of Comparative Psychology, 4 19-81^ 225-288 (1924). 
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a large number of obscure factors, all changes within a single 
individual must be noted during the experiment itself In papei- 
and-pencil testing it is possible to compare one individual with 
another, one gioup with another, or the response of an individual 
at one time with his response at another time But with these 
physiological measuies it is not possible to compare one indi- 
vidual with another or one group with another or an individual’s 
record at one sitting with his record at another sitting All com- 
parisons must be made between changes m the record during a 
continuous sitting when the environment is under control and 
relatively constant Even so, metabolic processes within the body 
or the influence of ideas produce changes which are impossible 
to control or interpret 

In ordinary clinical practice all observations are noted by the 
experimenter and recorded by him All observations represent a 
cross-section picture of a continuous process Pulse is counted 
for a fraction of a minute and recorded. Changes in pulse rate 
can be obtained by taking separate pulse counts and noting the 
change in rate Or blood-pressure is taken at a given moment and 
recorded Changes in blood-pressure aie determined by taking 
two or more separate readings and subtracting to determine the 
amount of change 

For accurate and precise work a continuous record is very 
desirable In the first place, if the record can be recorded me- 
chanically, one possible cause of error which occurs in observa- 
tion and the lecordmg of observation is eliminated In the second 
place, a continuous record permits much more precise infor- 
mation as to the time when changes occur, and the amount of 
these changes Smaller movements or changes that may have 
some significance will utterly escape the experimenter who must 
measure the process at infrequent intervals, but these can be ac- 
curately noted on a continuous record. 

The instrument used in physiology for recording such con- 
tinuous changes is known as the kymograph (curve-writer), in- 
vented by the famous physiologist Ludwig. This consists of a 
metal cylinder (A), called the drum, which may be made to 
revolve slowly at a constant and determined rate by means of a 
clock-work (B). Around this metal cylinder is wrapped and glued 
a sheet of glazed paper. This paper is smoked by moving it swiftly 
over a gas flame until it is coated with a thin, uniform layer 
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of soot This smoked paper surface is then used as a recording 
surface The recoid is made by rhe tracing of the point of a 
lever arm (tambour) which is moved by the phenomenon 
(pulse or breathing) being studied The point of the lever arm is 
brought lightly in contact with the smoked paper, and the record 
IS made by scratching a path through the sooty surface. This 
tambour or lever arm is placed at tight angles to the axis of the 
cylinder so that ordinates of the kymograph record represent 
the amplitude of the phenomenon being investigated, while the 
abscissae are time inteivals Several tambours may be in opera- 
tion at the same time so as to give a simultaneous record on the 
same kymograph sheet. Since the revolving mechanism does not 
usually record the speed of revolution it is necessary to have a 
special time marker or tambour which is made to move at regular 
intervals of a second or less by clock-work When the record is 
completed, the paper is slipped through a bath of diluted white 
shellac which coats the recoid and makes it permanent 

In more recent developments sensitive photographic paper has 
been used on the kymograph cylinder This is made to revolve 
behind a narrow slit A moving spot of light may be made to 
focus on this slit, thereby giving a record Or the lever arm may 
be placed m the path of a bright light and made to send its 
focused shadow on the slit where the shadow is recorded on the 
sensitive paper. This photographic method is used in recording 
the movements of the string galvanometer. 

Measurement of the Rate of the Heart-Beat — the Pulse 

The pulse may be measured simply by pressing the radial 
artery in the wrist and counting the number of beats for a given 

•The more simple tambours «oik by means of a system of levers Some- 
times, however, it is not convenient to transmit niov cmciits to the kymograph 
by means of levers In such cases, when the kymograph is some distance from 
the instrument, a device known as ‘Marcy’s tambour” is used This consists 
of a metal tray or basin covered with a rubber membrane This rubber mem- 
brane may be made to move up and down with certain pressure changes in 
the rubber tube which connects it with the measuring instrument The lever 
arm is connected (Jftectly with this rubber membrane and records its movements 

Sometimes two Marey’s tambours are employed, one of which is on the 
measuring instrument itself, the membrane of which is stimulated directly A 
rubber tube conveys changes in pressure in this first Marey’s tambour to the 
second at the kymograph some distance away The two rubber membranes 
must oscillate in unison ' 
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numbei of seconds An instrument for recording the pulse-beat 
IS known as the sphygmograph, which not only will give the pulse 
rate but furnishes a detailed pictuie of the character of the 
aiterial pulse. This lattei, however, is not needed in studying 
the emotions. 

The first spkygmogiaph was invented by Marey, and is rela- 
tively simple A pelotte or button is placed directly over the ladial 
artery in the wrist The movements transmitted to it are rela- 
tively small These must be magnified by a series of levers until 


♦ 

The Dudgeon Sputcmograpii in Position 
From Howell’s Text-Book of Physiology, W B Saunders Company, 1924. 

finally the movement is transmitted to the writing lever arm 
W'hich makes the record directly on the kymograph 

The Dudgeon sphygmograph is easier to use than the Maiey 
instrument It has two improvements One is a device by means 
of which the piessuie of the button on the arteiy can be grad- 
uated The other and most important variation is that the kymo- 
graph IS an essential part of the instrument and consequently 
the arrangement of levers can be better controlled Even in the 
best of these instruments, howevei, there is a certain amount 
of looseness in the lever joints giving an inertia which prevents 
the graphic picture from corresponding exactly with the pulse 
wave However, it is sufficiently accurate to afford a measure of 
pulse rate. 

Pulse rate has not been used extensively as a measure of emo- 
tional excitement, even though it is easy of determination, mainly 
because it is subject to such a variety of forces. The important 
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factors which influence pulse are given below, the first three men- 
tioned representing lather fixed adjustments 

1. Variations with sex The pulse rate of women is known to 
be higher than that of men at all periods of life 

2. Variations with size. The larger an individual the slower his 
pulse, other things being equal. This may in part explain the 
sex difference. Large animals have very definitely slower pulse 
rates than small animals, and some small birds are said to have 
pulse rates of several hundred beats a minute. 

5. Variations with age. As one gets older, the pulse rate grows 
less In extreme old age, however, there is a slight acceleration, 
due perhaps to selection or to shrinkage of body volume. The 
following table * gives the approximate average rates. 

Table 79 

Pulse Rates at V vrious Ages 



Pulse beats 


per minutes 

At birth 

140 

Infancy 

120 

Childhood 

100 

Youth 

90 

Adult age 

75 

Old age 

70 

Extreme old age 

7 S- 8 o 


4. Variations with temperature The pulse rate varies with 
the temperature of the blood As there is an increase in the tem- 
perature of the blood there is a coi responding increase in heart- 
beat. In cold-blooded animals the heart-beat is very sluggish 

5 Variations with the ptesence of dnigs in the blood stream 
Certain drugs, notably adienalin, will increase the rate of the 
heart-beat directly Other diugs, such as potassium salts, will 
retard or even entirely stop the heart action. 

6 Vaiiations loith exercise The next five factors are thought 
to influence pulse rate through the action of the nervous system. 
First, it IS well known that muscular exercise markedly affects 
the rate of the heart-beat This effect, while not ordinal ily no- 
ticed, may be produced by even very light muscular activity 
such as tapping a telegraph key It is probably this factor more 

• See Burton-Opitz, R , A Textbook 0} Physiology (W B Saunders Company, 
1920), p 273 
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than any other that makes It so difficult to use the changes in 
pulse rate as a measure of emotions This increased pulse rate 
is an adaptive mechanism for providing the muscular system 
with an increased supply of blood 

7. Fanatwns in postuie The pulse rate is higher when stand- 
ing than when sitting, and higher when sitting than when lying 
down This phenomenon may be included under the foregoing 
one, inasmuch as the difference is probably due to diffciences in 
tone of the postural muscles. 

S. Variations with eating. The pulse rate is increased after 
food IS received into the stomach This is believed to be a regu- 
lating mechanism to counteract the effect on blood-pressure of 
the laige vascular dilatation in the intestinal aiea 

g. Variations with swallowing Stalling * states that the act of 
swallowing causes a reflex quickening of the rate of the heart- 
beat by inhibition of the tonic vagus action. 

10, Variations with blood-pi essute There is an inverse rela- 
tion between blood-pressure and heart rate Low blood-pressure 
tends to quicken heart late, while high blood-pressure tends to 
make the heart beat more slowly This is an adaptive mecha- 
nism for equalizing the blood supply to the tissues undei differ- 
ences in blood-pressure 

II Venations due to emotion Besides all of these. factors in- 
fluencing heart rate, there is left the direct influence of emotion, 
the direct accelerating or inhibiting action of the neivous system 
It IS well known that the motor nerve impulses traveling ovei the 
vagus nerve tend to slow up the heait-beat, whereas the impulses 
passing out through the sympathetic nervous system tend to 
accelerate the heart-beat. These two sets of controls work in a 
truly antagonistic manner at all times, and thereby exeit a regu- 
lating mechanism on the heart rate Sometimes the sympathetic 
system is aroused directly by pcripheial stimulation, thereby 
causing an increase in the rate of the heart-beat But there is 
also an interrelationship between the various factors Exercise, 
blood-pressure, and presence of adrenalin in the blood stream are 
also associated with emotional excitement so that there is an 
interrelationship between the various factors controlling the pulse 
rate. 

* Starling, E H , Pnnciples 0/ Human Physiology (London, J, & A Churchill), 
P 803 
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Because the pulse is so sensitive to a number of factors, some 
of which are related to the emotions only remotely and some 
not at all, pulse rate is difficult to use as a measure of the 
emotions 


Measurement of the Amplitude of the Heart-Beat 


Besides the rate of the heart-beat, the aviphtude of the heart- 
beat is known to vary with emotional excitement Usually rate 
and amplitude go togethei As one inci eases, the other increases 
also But they do not always run in parallel this way Physi- 
ologists believe that different fibers from the vagus and also from 
the sympathetic system control rate and amplitude of the heart- 
beat. 

The sphygmograph, which gives a graphic record of the pulse 
wave, may be used to give a measure of the amplitude of the 
heart-beat as well as of the late. 

The factors which govern rate also in general govern ampli- 
tude. Under just what conditions one is accelerated or inhibited 
more than the other has been studied m some detail by Eng (4). 
Her findings are as follows. 


Attention 
Physical work 
Displeasing taste and 
smell 
Pain 
Fright 

Displeasure and excite- 
ment 

Pleasing smell 
Pleasing taste 
Pleasure resulting from 
other sensory impres- 
sions 


Pulse height 
Decrease 
Increase 
Decrease 

Decrease 

Increase followed 
by decrease 
High 

Increase 

Increase 

Increase 


Pulse rate 
Retarded 
Accelerated 
Accelerated 

Accelerated 
Accelerated, then 
retaided 
Accelerated and 
irregular 
Retarded 
Accelerated 
Retarded 


Measurement of Blood Pressure 

Some investigators, notably Marston and Larson, have made 
extensive use of blood-pressure in the measurement of emotional 
excitement and claim to have achieved satisfactory results. 

It is well known in physiology that a certain pressure is main- 
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tamed against the walls of the arteries due to the force of the 
heart-beat, the resistance in the capillaries, and the pressure of 
the aiterial walls This artetia! pressure is exceedingly sensitive 
to a number of factors, among which may be included emotional 
excitement. 

This aitenal or blood-pressure is measured by noting the 
force or piessure necessary to collapse the arterial wall and pre- 
vent the pulse beat from passing a given point. 



Sfhygmom<inoi>ietcr 


The instrument for causing this collapse of the arterial wall 
and measuring the pressure necessary to do this is known as a 
sphygmomanometer. (Pronounce this in two parts — sphygmo- 
manometer. Sphygmo- comes from the Greek meaning “pulse ” 
A manometer is an instrument used in physics for measuring 
pressure Consequently a sphygmo-manometer is an instrument 
for measuring blood-pressure.) The instrument commonly used 
IS known as the Riva Rocci sphygmomanometer, and is really 
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quite simple in construction and operation. It consists of three 
parts. The first part is for compressing an artery. This is usually 
accomplished by wrapping a band (nowadays usually made of 
silk) around the arm just above the elbow. Inside this band is 
a rubber bag which also encircles the arm This rubber bag is 
inflated w_ith air and by an increase in the pressure of the air 
in the rubber bag a corresponding increase of pressure is com- 
municated to the arm This pressure may be raised to such a 
point that the brachial artery is made to collapse, thereby pre- 
venting the pulse from coming through Pressure is raised by 
means of a rubber bulb squeezed by the hand 

The rubber tube by which the air pressure is communicated 
to the pressure bag on the arm is also connected with a manome- 
ter or pressure gage This consists of a U-tube of mercury One 
end of the tube is open to the air, the other receives the pres- 
sure. As the pressure is increased, it forces the meicuiy down 
on one side of the tube and up on the other The height to which 
the mercury rises may be read on a graduated scale placed 
behind the tube, and this reading becomes a measure of the pres- 
sure necessary to collapse the artery Sometimes a calibrated 
aneroid gage is used m place of the mercury manometer, but 
this instrument is not so accurate as the mercury manometer. 

In applying the instrument the fingers of the left hand are 
placed on the radial artery of the wrist while the right hand 
IS employed to inflate the rubber pouch At the moment that 
the pulse disappears a reading is taken which is known as the 
systolic pressure. Sometimes this is checked by raising the pres- 
sure above that necessary to stop the pulse and then noting the 
pressure reading at the point where the pulse beat first breaks 
through. In the auscultatory method {auscultation means “the 
act of listening,” hence the auscultatory method is the method 
whereby impressions are gained by listening), the pulse is ob- 
served by using the binaural stethoscope (A stethoscope is an 
instrument used by physicians and others for hearing more dis- 
tinctly than can be done with the ear alone sounds coming from 
within the body, particularly the chest Stetho- comes from the 
Greek meaning “chest.” The use of the stethoscope is not, how- 
ever, confined to the chest. Binaural means that the stethoscope 
conveys the sounds to both ears ) The stethoscope is applied to 
the brachial artery just below the point where the pressure band 
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of the sphygmomanometer is applied. The pressure is raised above 
the systolic level and then allowed to fall slowly by means of 
a needle valve. At the first point when the sound breaks through, 
the manometer reading should be recorded as the systolic pres- 
suie. By allowing the pressure to drop still further the diastolic 
pressure may also be approximately observed Ettinger, who has 
studied the method extensively, notes five phases of the sound 
in the stethoscope as the pressure is allowed to fall. 

1. The initial clear, sharp sound — systolic pressure. 

2. The sounds become more continuous, muffled, like a mur- 
mur. 

3. The sounds become progressively clearer and louder. 

4. The sounds become muffled and dull. 

5. The sounds disappear. 

According to later workers, the diastolic pressure is indicated 
most accurately by tlie transition from phase 3 to phase 4, at 
the beginning of phase 4. 

Systolic pressure is usually employed in testing for the presence 
of emotion. 

It will be noted that this method of observing blood-pressure 
does not permit a continuous reading Indeed, it is only by means 
of a somewhat complicated apparatus, known as the Erlanger 
sphygmomanometer, that the readings for a single testing of 
blood-pressure can be recorded on a kymograph. 

Koll, however, has described an instrument for obtaining a 
continuous reading of blood-pressure. Roll’s (15) method and 
apparatus for giving a continuous record of blood-pressure by 
the indirect method are not very well known. Yet the principle 
involved is so simple as to warrant more extensive experimenta- 
tion with the method. 

Koll reports that continuous observations may be made for 
fifteen or twenty minutes, after which the continued pressure 
on the arm causes considerable discomfort. Ordinarily in using 
the instrument one takes a tracing for four or five minutes, de- 
flates the pressure for a minute or so, and then resumes the 
tracing 

The factors which influence blood-pressure are many indeed. 
They will be enumerated in order with little comment It may 
be seen by following,, through the various factors that blood- 
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pressure is one of the major factors in determining daily or even 
hourly efficiency. In the following summary the pressure re- 
ferred to IS that in the arteries 

I Blood-pressure vanes directly with the volume of the blood. 
Extensive blood-letting is accompanied by a decided fall in pres- 
sure. 

2. Blood-pressure varies directly with the energy of the heart, 
other things remaining constant. The pressure increases with an 
increase in the rate and force of the heart-beat. In accurate meas- 
urements of blood-pressure in animals, in which an artery is 
opened and connected directly with a manometer, it is found 
that the pressure varies with the cycles of the heart-beat. The 
pressure is strongest at the time of the ventricular systole. 

5 Blood-pressure vanes with the elasticity of the blood-vessels. 
This IS partly a true elasticity of the tissues constituting the 
arterial wells and is partly a state of tonus of the smooth mus- 
culature which form a part of the wall of every artery. 

4. Blood-pressure vanes with the peripheral resistance. That 
is, blood-pressure is controlled m part by the resistance of the 
smaller arteries and capillaries farther along in the arterial 
system 

5. Gravity causes variations in blood-pressure. When one is 
erect this force tends to increase piessure in arteries at levels 
below the heart and to decrease it at levels above the heart Upon 
lying down a distinct change in the relative pressure is effected. 

6. Blood-pressure is higher on the average in man than iri 
woman. 

7. Blood-pressure varies with the size of the animal. In general 
the larger the animal, the higher the pressure. 

8. Blood-pressure increases with age, as is shown in the fol- 
lowing table taken from Burton-Opitz In old age blood-pressure 
increases due to the loss of elasticity of the vascular tissue 

g. Breathing causes variations in blood-pressure. As the breath 
is taken in, suction is exerted upon the blood-vessels in the 
thorax and the large veins near the heart, causing a fall in pres- 
sure. The converse effects are noted when the breath is expelled 
The breathing consequently causes small, undulating changes in 
pressure throughout the arterial system. Deep and forced breath- 
ing increases the pressure in general. 
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Table So 

Br onn-PRESsuRES at Various Ages 
(from Burton-Opitz *) 


First few months .. . . 


Blood-pressure in 
millimeters of 
mercury 

70- 75 

1- 2 years 

. 

80- 90 

2- 3 years 


90-100 

3-10 years 


9S-IIS 

lo-rs years 


100-1 IJ 

15-20 years 

. 

105-128 

20-30 years 


13s 

30-40 years 


140 

40-50 years 


142 

50-60 years 


154 

60-70 years 


180 


10 A rhythmical nse and fall of blood-pressure is known as 
the Traube-Henng waves Each of these waves is longer than 
those due to respiratory movements. These waves are believed 
to be due to a rhythmical action of the vasomotor center. 

ij. Blood-pressure is higher after food has been taken into' 
the system This may be due to a concentration of blood m the 
abdominal organs, while in the splanchnic area the pressure is 
lowered. 

12 Muscular exercise causes a nse in blood-pressure To this 
may be coupled other factors Blood-pressure is lower during 
sleep, probably largely because of the cessation of muscular ac- 
tivity. Blood-pressure rises during mental work While this may 
represent a distinct phenomenon, it probably is due to the in- 
creased muscular activity which is concomitant with mental labor. 

15 Cold baths as well as hot baths produce a rise in blood- 
pressure, although for a very different reason In the one case 
there is a marked constriction of the surface arteries, in the 
other the increase is due to an increase In the frequency of the 
heart Baths at the temperature of the body have no influence 
on blood-pressure 

14. Blood-pressure falls with fatigue. Such substances as lactic 
acid and carbon dioxide, waste products of muscular activity, 
cause a relaxation of muscular tonus and produce a dilation of the 
arteries, thus providing more blood at a time when it is needed 

* Burton-Opitz, R, A Textbook of Physiology (W B Saunders Company, 
1920), p 370 
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On the other hand, it is well known that the presence of carbon 
dioxide in the blood causes a rise in arterial pressiiie through 
the vasomotor centers Starling harmonizes these two opposite 
influences as follows' 

“We thus see that carbon dioxide, which is the universal hor- 
mone set free m the circulation when the activity of the body 
as a whole is increased, has a double effect on the blood-vessels 
— a central effect through the vasomotor centers, medulla, and 
spinal cord, causing contraction of the blood-vessels, and a local 
peripheral effect causing dilatation of the blood-vessels. The gen- 
eral result therefore will be to cause dilatation of the blood-vessels 
of the part where the carbon dioxide is produced and where it is 
present in greatest concentration, and vascular constriction else- 
where under the influence of the sensitive nervous centers ” 

15 Blood-pressure is loweied during menstruation and raised 
during pregnancy, and is remaikably higher during labor. 

16. Pam causes a rise in blood-pi ersuie 

If. Cold produces a rise in blood-pi essure. 

18. Certain glandular products cause marked changes in blood- 
pressure. Adrenalin produces a marked constriction of the ar- 
terioles and a rise in blood-pressure Thyroid deficiency causes 
a drop in blood-pressure, while thyioid extract, when fed, causes 
a faster pulse and higher blood-pressure Likewise pituitary ex- 
tract causes a rise in blood-pressuie 

jp. Finally we come to the effect of emotion on blood-pressure. 
In general emotional excitement causes a use in blood-pressure. 

It should be observed that theie is an intricate intci dependence 
among these various factors The intioduction of one factor as a 
stimulus causes a complex leadjustment in the body For in- 
stance, under the influence of an emotional stimulus blood-pres- 
sure is directly increased But the heart rate and force also 
increase, breathing increases in rate and in amplitude, and 
adrenalin is poured into the blood stieam These reactions also 
have their own effect on blood-pressure Blood-pressure in turn 
has Its influence on the other factors The final outcome is an 
intricate adjustment and balance of the various reciprocal factors. 

It should also be observed that while on the whole the vascular 
system works as a unit, there are special controls for distributing 

•Starling, E H, Principles of Human Physiology (London, J Sc A Churchill), 
P 850 , . 
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the blood supply to different parts of the body. In general there 
are vaso-constriclor and vaso-dilator fibers which work antago- 
nistically. Concerning this Howell says, 

“It may be supposed that under normal conditions the activity 
of this mechanism is adjusted so as to control the blood-flow 
through the different organs in proportion to their needs. When 
the blood-vessels of a given organ are constricted the flow thiough 
that organ is diminished, while that through the rest of the body 
IS increased to a greater or less extent corresponding to the size 
of the area involved in the constriction. When the blood-vessels 
of a given organ are dilated the blood-flow through that organ 
IS increased and that through the rest of the body diminished 
moie or less The adaptability of the vascular system is wonder- 
fully complete, is worked out mainly through the reflex activity 
of the nervous system exerted partly through the vasomotor 
fibers and partly through the regulatory nerves of the heart.” * 

It must be evident from all of the foregoing that it is difficult 
to use blood-pressure as a simple indication of the pressure of 
emotion. So many factors operate to influence blood-pressure that 
conditions must be very carefully controlled if the presence of 
emotion is to be detected. One thing is certain — when this is 
our purpose, we cannot compare’ blood-pressure obtained at one 
sitting with pressure at another sitting, or pressure in one person 
with that in another person In experimental work all compari- 
sons must be made between changes in the same individual dur- 
ing the same sitting, so that all factors may be under rigid 
control Even at best there is certain to be variation in the 
forces acting, due perhaps to metabolic processes going on within 
the body or to the emotional influence of stray thoughts Experi- 
menters report just such unexplainable variations. If blood- 
pressure is to be used in the study of emotion, it must probably 
be used when the emotion is so very intense as to outweigh in 
effect any of the lesser variable forces acting. 


Measurement of Blood Volume 

Still another index of the changes in the circulatory system 
is a measure of the blood supply in an organ, as determined by 
measuring the volume of some part of the body, such as the 

•Howell, W 'R , A Textbook of Physiology (W B. Saunders Company, 
^929)1 P 622. 
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hand and forearm The measurement of volume is complementary 
to measurement of pressure and pulse. As the arteries and ar- 
terioles in an organ become constricted, with consequent diminu- 
tion in the volume of the organ, the pressure on the aiterics 
supplying blood to that organ becomes higher Conversely, as 
the volume becomes greater, the pressure becomes less Also as 
the pulse increases in rate and amplitude, the volume in the 
peripheral musculature tends to increase, and vice versa Since 
emotion has an influence on the vasomotor system, the measure- 
ment of the volume of an organ is another impoitant means of 
estimating the presence and amount of emotion 

The instrument for measuiing blood volume is called a plethys- 
mogi-aph (pletliysmo- fiom the Greek meaning “enlargement”). 
The plethysmograph in reality is an instrument for measuring 
the volume of a limb, but because changes m the volume of a 
limb are due to changes in blood volume, the instiument becomes 
an instrument for measuring blood volume The most common 
form of the plethysmograph is that used by Lehmann, which 
measures the volume of the foiearm and hand The following 
description is borrowed fiom Eng (4, pp 5-7), who made prac- 
tical use of the Lehmann plethysmograph in her experimental 
work. 

“The apparatus employed to lecord the volume-pulse curve 
of the arm was an impioved foim of plethysmograph devised 
by Lehmann It consists of a metallic cylinder open at one end. 
Inside the metal cylinder lies a sleeve (P) of the finest soft 
rubber which is attached to the 11m of the cylinder, forming a 
water-tight union During the expeiiments the arm rests in the 
rubber sleeve, the space between the sleeve and the cylinder 
IS filled with water, so that the sleeve is moulded to the hand 
by the pressuie of the water and coveis it like a glove The water 
IS admitted through a small opening at the farther end of the 
cylinder w^hich is connected by means of a short pipe and a 
piece of rubber tubing to a glass funnel fixed at the requisite 
height on a stand A short pipe from the side of the cylinder 
opens at the top into a tube 10-14 cm long and of about i cm 
in diameter, which acts as a water manometer (V) The tube is 
closed above by a rubber stopper through which a piece of glass 
tubing passes, and the glass tubing is connected by means of 
rubber tubing to the recording apparatus 

“The water flows into the plethysmograph until it reaches up 
into the glass tubing, every further expansion of the arm (whether 
It be caused by individual pulse beats jOr by more marked 
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changes) drives the water somewhat higher up, and is thus 
made visible by the change in the surface-level of the water If 
the volume decieases, the water-level sinks correspondingly The 
changes of piessure thus produced are transmitted through the 
air in the lubbei tubing to the membrane of the recording ap- 
paratus, and are inscribed on the kymograph. The necessary 
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Plethi smocr \ph 

From Eng, Ilclga, E\perimental Inve^tigahons Into the Emotional Life r 
the Child Compated with That of the Adult (London, Humphrey Milford Ox- 
ford University Press. 1925, New York, Oxford University Press), p 6 


pressure on the arm is adjusted according to the readings of 
the watei manometer, it must not be too weak or the rubber 
sleeve will not envelop the arm closely enough, nor must it be 
too strong, as in that case it would give rise to an ill-defined 
curve The coriect amount of pressure for the best curve must 
therefore be decided by trial The pressure is regulated by raising 
or lowering the funnel, which is therefore called the manometer 
funnel ” 

Eng describes certain refinements in the method based on her 
experience, and her report should be consulted by anyone plan- 
ning to experiment with the apparatus. 

The factors which control blood volume are similar to those 
controlling pulse rate and height and blood-pressure, since volume 
is a resultant of these other phenomena working in combination. 
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Eng notes the following changes in volume due to various 
forms of activity and emotional excitement 


Attention-tension 
Psychical work 
Displeasing taste or smell 
Pam 
Fright 

Spontaneous displeasure due to sen- 
sory impressions 

Psychical displeasure (unpleasant 
concepts) 

Displeasure and excitement 
Depression 
Pleasing smell 
Pleasing taste 

Spontaneous pleasure from sensory 
impressions or concepts 


Falling 
Rising 
Falling 
F ailing 

Rising followed by falling 

Marked falling 
Low with abiupt rising 
and slower fallings 

High 

Low and even 

Rising 

Rising 

Rising 


Measurement of Rate and Amplitude of Breathing 

Breathing is another of the organic functions which shows 
marked change under conditions of excitement Since breathing 
is the portal for the purification of the blood and its renewal 
with oxygen, it has an important place in the metabolic cycle 
When a sudden and strong stimulus arouses the sympathetic 
nervous system to action, thus putting the body into condition 
for immediate and vigoious activity, among the other prelim- 
inary adjustments is a quickening of the breathing and an increase 
in Its amplitude 

The rate of bieathing may be noted simply by counting the 
number of inspiration movements of the chest during a minute 
or part thereof This simple method, however, does not leave 
behind its own record When a record of the rate of breathing 
is transferred to the kymograph, it is convenient at the same 
time to make a record of breathing amplitude, so the measure- 
ment of the two phenomena will be considered together 

The instrument for recording the characteristics of the re- 
spiratory movements is known as a pneumograph The form usu- 
ally employed was originally designed by Marey In its simplest 
(and perhaps most satisfactory) form it consists of a stout, 
flexible rubber tube which may be stretshed at will This is 
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stretched part way around the subject’s chest, being held in place 
by a light chain and hook One end of the tube is closed, the 
other end has a nipple, to which is attached the tube connecting 
the pneumograph with the kymograph tambour. As the subject 
breathes, the rubber tube is stretched, air being forced out This 
change in air volume is transferred to the rubber membrane of 
the tambour. 



PSEUMOCRAPH 

A more complicated form of Marey’s pneumograph consists 
of a plate of rather flexible metal so strapped to the subject’s 
chest that it bends as the breath is inhaled and exhaled, thus 
causing changes in a Marey’s tambour fixed directly above the 
plate by a series of levers These changes in pressure are then 
transmitted by a rubber tube to a second Marey’s tambour at 
the kymograph, where they are recoided. 

The factors causing changes in respiration rate and amplitude 
are many Although many of them are similar to those which 
cause changes in pulse, blood-pressure, blood volume, etc , others 
are different, so that a review of these factors will be helpful 
There are no authoritative descriptions of conditions where rate 
and amplitude of respiration do not go together. Usually changes 
in one produce changes in the other. 

The average rate of respiration is i8 per minute 
I Rate of respiiatxon changes with age. Table 8i, taken from 
Burton-Opitz, gives the averages for different ages 
2 . Rate of respiration vanes vnth the size of the animal. In 
general the smaller the animal, the faster its rate of breathing. 
This is due to the fact that “smaller animals possess a more ex- 
tensive body-surface in relation to their mass than the larger 
ones, and hence suffer a much greater loss of body heat.” 

5 Changes in respiration are directly connected with changes 
in heart-beat, blood-pressure and the like. Usually both are re- 
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Table Si 


Rate of Breathing at Various Ages 


Age 

Ratt of brt<!thng 

New born 

62 

0- I 

44 

S-iS 

26 

15-20 

20 

20-25 

187 

25-30 

IS 

30-50 

17 


suits of the same cause, but there may be a direct reciprocal rela- 
tion. 

4. Respiration is directly proportional to the CO2 tension of 
arterial blood. This is the primary factor producing momentary 
variations in breathing. Many of the factors which follow can 
be traced back to this influence of the blood composition. 

5. Rate and amplitude of respiration aie increased with an in- 
crease in carbon dioxide content of the air. The increase in the 
carbon dioxide content of the air must be considerable before any 
effect IS noted This is probably due to the inadequate washing-out 
of the carbon dioxide in the blood stream. 

6. Respiration increases with increased barometric pressure. 
A corresponding increase in the depth of respiration is noted when 
the pressure is reduced in high altitudes High barometric pres- 
sure as in caisson ivork, or low pressure as in airplane flights, 
produces a variety of physiological effects, of which change in 
breathing is only one. 

7. Rate and amplitude of breathing increase with exercise This 
phenomenon is too common for comment 

8. Breathing is gi eater when erect than when reclining This is 
probably connected with muscular activity. 

p. Breathing decreases in sleep. 

10. Breathing incieases when speaking, for obvious reasons. 

11. Breathing increases with increases in heat both of the body 
and of the surrounding air. This is probably associated with the 
increased rate of metabolism in the body. 

12. Breathing increases momentarily when one is dashed with 
or immersed in cold water. 

13. Breathing increases mth pain. 

14. Breathing may be increased or decreased in rate and am- 
plitude by an act of will. * 
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75. Breathing is increased under the influence of emotion. 

Eng's (4) expenmental ■vvoik also yields data concerning the 
changes in bieathing produced by vaiious kinds of affective 
stimuli. 


Attention-tension 
Psychical work 
Displeasing taste and smell 
Pam 
F right 


Respiration 

Retarded 

Accelerated 

Retarded 

Accelerated 

Retarded, followed by 
acceleration 

Small effect 

Small effect 

Unchanged 

Retarded 

Accelerated and shallow 
Accelerated 


Spontaneous displeasure due to sensoiy 
impressions 

Psychical displeasure (unpleasant con- 
cepts) 

Displeasure and excitement 

Depression 

Pleasing smell 

Spontaneous pleasure from sensory im- 
pressions or concepts 


The same cautions that have been mentioned concerning pulse 
and blood-pressure apply to the use of changes in breathing as 
a measure of emotion Only marked changes are significant, and 
It IS not possible to make use of norms or the changes from one 
day to another, since all changes must be noted during the same 
sitting Even so, extreme care must be used in not intioducing 
any exciting change except that planned in the experimental 
procedure. 


Measurement of the Psychogalvanic Reflex 

Landis and De Wick (56), in an extensive review and sum- 
mary of the literature beaiing on the psychogalvanic reflex, tell 
us that the first mention of the “electrical tension of the skin” is 
to be credited to Bertholon in 1786, while Fere is ordinarily given 
the credit as being the first to demonstrate the psychogalvanic 
reflex in 1888 The early workers — ^Vigouroux, Fere, Tarchanoff, 
Sommer, and Veraguth — discovered most of the phenomena con- 
nected with the reflex and prepared theories in explanation of it. 
Vigouroux (98) in 1888 concluded that the electrical resistance 
of the body is a function of the vasomotor system. Fere (30) in 
the same year first ^called attention to the variations caused by 
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emotional changes. Tarchanoff (89) in 1890 attributed this elec- 
trical change to the secretory activity of the sweat-glands 

It has been found that if two points on the skin are connected 
so as to form an electric circuit, a current passes through them, 
due to a difference in potential at the two points on the surface 
of the body. This difference in potential is relatively large if one 
of the spots IS a region of the skin rich in sweat-glands and the 
other a region relatively devoid of them, the difference in poten- 
tial IS small or absent for symmetrical points of the body, such 
as the corresponding spot on the two arms Changes in mental 
activity, excitement, etc, will cause a change in the amount of 
current flowing through the circuit If a small current be passed 
through the circuit from an outside source of constant voltage, 
the amount of this current will change with changes in emotional 
excitement. The electrical variations of the skin under emotional 
excitement without an exterior electromotive force is known as 
Tarchanoff’s phenomenon The same variations when an outside 
current is added is known as Fere’s phenomenon In the latter 
case the electrical changes due to emotional excitement are much 
greater. It is generally believed, however, that the changes in the 
two cases are both due to the same phenomenon 

In experimental work the amount of outside current to be used 
has been the subject of debate. Some have used none, although 
without an outside source of current the emotional effects are 
small and a correspondingly more sensitive galvanometer must be 
used On the other hand, there is difficulty in maintaining a con- 
stant voltage and resistance in the system when an outside cur- 
rent IS employed The majority of expeiimenters use from two to 
three and one-half volts. 

The amount of electrical current is measured by a galvanome- 
ter. There are many kinds of galvanometers The kind generally 
employed in the extremely delicate work of measuring the elec- 
trical currents of the body is known as a D’Arsonval galvanome- 
ter (varrously called a moving coil galvanometer or a mirror gal- 
vanometer) The principle on which this galvanometer is based 
is the relation between the current passing through a coil of wire 
suspended between the poles of a stationary magnet and the lines 
of force set up tending to move the coil The coil of wire becomet 
an electromagnet when a current is passed through with poles 
at right angles to the poles of the stationary magnet. This sets 
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up a force tending to turn the coil and make the electromagnetic 
field in the coil become paiallel to the field of the permanent mag- 
net In the D’Arsonval galvanometer there is a stationary horse- 
shoe magnet Between the poles is suspended a coil of fine wire. 
The current enters the coil from below on a loose spiral wire, 
and leaves at the top by the fine wire on which the coil is hung. 
The coil IS thus free to move between the poles of the magnet. 
Usually there is a stationary bar of soft iron in the space within 



D’Arsonval Galvonometer 


the frame on which the coil is wound to help concentrate the 
magnetic lines of force The amount of twist or movement of 
the coil IS a measure of the amount of current passing. 

On the wire by which the coil is suspended is attached a mirror 
which IS made to reflect the readings from a scale If this mirror 
IS viewed through a telescope having a fine vertical hair line, 
readings of the scale may be used as measures of the current. 
A beam of light may also be reflected onto a moving sensitive 
photographic film, thus giving a continuous record. 

A Wheatstone bridge (an instrument for measuring electrical 
resistance) is usually placed in the circuit, both in order to meas' 
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ure the resistance offered by the body to the passage of the cur- 
rent, and also for the purpose of bringing the galvanometer needle 
to zero at the beginning of the experiment. The Wheatstone bridge 
is not necessary, however, yj the experiment. 

Wechsler (no) found that the electrical resistance of the body 
varies during the progress of an experiment, often in an unpre- 
dictable way. He believed it to be desirable to make some sort 
of adjustment of the exsomatic (from outside the body) current. 



CIRCUIT WITH SUBJECT IN SERIES 

CIRCUIT WITH SUBJECT IN WHEATSTONE BRIDGE 

D. Sw - DOUBLE SWITCH S'SUBJECT R B-CALIBRATED RHEOSTAT 

E, e'- ELECTRODES 

Electric Connections tor Apparatus for Measuring the Psychogalvanic 

Reflex 

From Wcclislcr, Daiid The Measmcment of Emotional Reactions, Archives of 
Psychology, No 76, p 127 


This he accomplished by a potenUometei (a rheostat which per- 
mits variations in the amount of cuirent allowed in the circuit 
and inadvertently, as it were, measures the amount of current), 
which IS manipulated during the process of the experiment when 
it is noticed that the galvanometer needle does not go back to 
zero when at a state of rest. This does not seem to the writer 
to be defensible experimental technique If such a procedure is 
necessary, it is testimony to the grave difficulty of using the psy- 
chogalvanic reaction as a measure of emotion at all. 

Godefroy (41) employed an interesting circuit which includes 
a transformer. The subject is in circuit iij the primary coil, while 
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the galvanometer is in the secondary coil circuit. In this circuit 
the galvanometer needle goes back to zero of itself after every 
change of the cuirent, so that variations in the body resistance 
do not act as an influence on the galvanometer. This is a happy 
solution of the vexing problem of allowing for variable resistance. 

The latest device for measuring small electrical currents very 
sensitively is the cathode-ray oscillograph, an instrument with 
which alternating current must be employed. The most simple of 
these instruments is the Brown tube This is an evacuated glass 
bulb A cathode ray is caused to pass through the tube, striking 
at the further end of the tube a zinc sulphide layer which becomes 
illuminated when the rays strike it. Two plates are placed on op- 
posite sides of the tube and connected in the psychogalvanic cir- 
cuit. As the current passes through these plates, they bend the. 
electric stream and cause the spot of light on the zinc sulphide 
disk to move This spot of light can be picked up photographically, 
whence it becomes a measure of the current passing through the 
body 

The question of the electrode to be used in the psychogalvanic 
reflex experiment is very difficult (An electrode is the plate or 
terminal of an electric current In this case it is the material used 
to make the contact with the body which completes the electrical 
circuit ) Wechsler gives three conditions to be considered in the 
choice of electrodes (a) the part of the skin to which they are 
applied, (b) the security of the contact, (c) the degree of polar- 
ization of the electrode. All electiodes polarize. (By polarization 
IS meant the accumulation on the surface of the positive electrode 
of some substance (a layer of ions) which increases the resistance 
in the circuit This polarization commences the moment the circuit 
IS completed and the current commences to pass, and acts con- 
tinuously to increase the resistance ) A variety of electrodes have 
been tried in the experimental work Some workers have used dry 
metal plates of brass, zinc, copper, tin, bronze, etc Others have 
tried liquid solutions. Wechsler had his subjects immerse two 
fingers of the same hand in porous clay cups filled with salt solu- 
tion and containing small plates of amalgamated zinc. The clay 
cups in turn were set in small glass jars filled with zinc sulphate 
The size of the electrodes also has been given experimental con- 
sideration In general it has been found that the greater the 
electrode surface, the less effective is the polarization. ^ 
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Several iateresting features of the psychogalvanic reflex may 
be noted For the first ten or fifteen minutes aftei the electrical 
circuit IS closed, there is a diminution or fall in resistance, so that 
in experimental procedure it is well to ait until the galvanometer 
needle has come to rest This phenomenon is said to be due to 
an initial polarization of the skin A second feature of the reflex 
IS a latent period of from two to three seconds after a stimulus 
is given A third characteristic of the psychogalvanic response is 
a small negative deflection which usually precedes the main de- 
flection. The presence of this phenomenon when all conditions 
for the reflex are right is doubted by some investigators, who 
attribute it to poor electrode contact, slight muscular contractions 
in the region of the electrode, or vaso-dilation Another unex- 
plained phenomenon is a small negative deflection following a 
positive deflection, a phenomenon which is so small, however, 
as never to be serious in practical work. 

Explanation of Psychogalvanic Phenomena 

That the skin should possess electrical properties is most in- 
teresting, and naturally many theories have been advanced to 
explain the phenomenon After their careful and exhaustive re- 
view Landis and De Wick (56) conclude that, “Conflicting evi- 
dence foiccs us to wait for fuither, better controlled, and more 
scientific experimentation before accepting any definite physiologi- 
cal explanation” Three theories, m geneial, have been advanced 
in explanation of the psychogalvanic phenomena, which may be 
called respectively the sweat, the vasomotoi, and the viuscular 
theories. 

Jeffress (48, p 141) is the best exponent of the sweat theory, 
which he desciibes as follows. 

“If we consider the interior membrane of the sweat-glands as 
semipermeable, we may desciibe the phenomena of secretion as 
an alteration of permeability which permits the passage of the 
perspiration from the interior of the sweat-glands into the tubules. 
If we further assume that the membrane is permeable to only 
one of the ions in the sweat, which is already in the tubules, 
or in the electrolyte of the electrodes, we may explain the phe- 
nomena of polarization through the skin If the membrane is 
permeable to only one of the 10ns, say the cation, the othero 
collect at the membrane and oppose the flow of current; in other 
words, the membrane becomes polarized a/id a counter-electro- 
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motive force is set up opposing the flow of current Now if upon 
stimulation (neivous), the permeability of the membrane is 
changed so that it is more or less permeable to both 10ns, the 
cunent would flow with less difficulty; le, the counter-electro- 
motive force would be reduced This, as we have seen, is what 
actually occurs upon stimulation ” 

Jeffress goes on to show that the nervous innervation of the 
sweat-glands is itself an electrical phenomenon (48, p 142) . “We 
should expect, then, that stimulation of the sweat glands would 
exhibit Itself by an electrical manifestation or ‘action cuirent’ of 
the gland itself Several expeiimcnts (Waller and Mendelssohn) 
have shown that activity of the sweat-glands is accompanied by 
an increase in electrical potential directed from the surface of the 
skin inward.” 

Jeffress himself found that the hand, which is relatively rich in 
sweat-glands, becomes more strongly negative to the indifferent 
parts of the body (mouth) when the subject is stimulated, lead- 
ing him to believe that it is a case of the action currents of the 
sweat-glands. 

This theory has much to commend it. 

The vasomotor theory has its strongest champions to-day in 
McDowall and H M Wells (19, 20, 59, 113, 114) These experi- 
menters have demonstrated a very close correspondence between 
the constriction of the blood vessels of the skin and the psycho- 
galvanic reflex On the other hand they cite cases from their data 
in which profuse sweating under the influence of drugs yielded 
only a normal needle deflection. The evidence is sufficient to 
indicate that there is a very close connection between the vaso- 
motor system and the psychogalvanic reflex, but this connection 
may be correlative only, since changes in the vasomotor system 
accompany excitement changes in general However good the 
grounds for believing that the vasomotor system may be in part 
responsible for the psychogalvanic reflex, we cannot at present 
understand the connection. 

The muscular theory seems less tenable, and has not been 
stressed in the most recent work. 

Factors Influencing the Psychogalvanic Reflex 

The use of the psychogalvanic reflex for the measurement of 
emotion is beset by^ difficulties, for besides the outside stimuli 
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which cause physiological variations, the factors which cause varia- 
tions in an clectiical cuirent must be considered 

I Variation in the bodily resistance is one of the major diffi- 
culties in using the psychogalvanic icflcx for the study of emotion. 
The electrical resistance of the body vanes in an unpredictable 
way duiing the course of an experiment, and quite apart from 
changes caused by the experimental stimuli presented 

2. The polarization of the electiodes influences the needle de- 
flection. The greater the polarization, the gi eater the resistance 
and the less the deflection Relatively unpolarizable electrodes are 
the most satisfactory. 

j The size of the electrode influences the needle deflection by 
making polaiization less effective. 

4. The points on the skin where the electrodes are applied have 
a marked effect on the needle deflection In general the reflex is 
more pronounced when one of the electrodes at least is connected 
with a region tich in sweat-glands 

5. The secuiity of contact of the electiodes influences the size 
of the deflection 

6. T he intensity of the current influences the size of the reflex. 
Within limits, the stronger the exsomatic current used, the greater 
the extent of the galvanometer needle deflection It is essential, 
therefore, that the strength of the current be constant duiing an 
experiment, unless proper allowance and correction are made. 

7. Muscular movement influences the tefiex. Many investigators 
have noted the relation between muscular activity and the psycho- 
galvanic reflex Indeed some have tiied to see a direct causal 
connection between muscular activity and the electrical resistance. 
One can understand, however, how muscular activity has its in- 
fluence on the peripheral blood vessels and the sweat-glands 
through the sympathetic neivous discharge, and it seems unneces- 
sary to hypothecate a direct connection between muscular activity 
and the reflex. 

<y. The length of time the exsomatic current flows has an in- 
fluence on the reflex Wechsler (no) gives evidence to show that 
m cases when a low intensity current was used with an electrode 
placed on a spot rich in sweat-glands (the palm of the hand), 
the resistance increases with time. This is probably a polarization 
effect On the other hand, when high intensities of current were 
tried, the resistance fell off with time, which Wechsler attributes 
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to inhibition (absorption) Between these two extremes there is 
every possible variation Wechsler finds that with low-voltage cur- 
rents applied to the finger-tips, the two opposing tendencies are 
neutralized and the resistance remains more nearly constant 

p. Peterson (62), Bmswanger (24), and hloravcsik (60) note 
an increased deflection when pressure is applied to the electrodes 
H M Wells (59) on the other hand reports that pressure on the 
electrodes may diminish or even prevent the reflex. Further re- 
search IS needed on this point. 

10 Resistance to alternating current is less than to diiect cur- 
rent, and does not produce such large fluctuations. 

11. Gildemeistcr reports that the positive variation of a con- 
stant current passed through the skin on stimulation of the skin 
surface is accentuated by raising the temperature of the electrodes, 
and that this local warming increases both the local time and the 
duration of the reflex. Landis (55), on the other hand, repoits no 
lelationship between temperature changes of the skm-electrode 
picture and the resistance of the body. It is conceivable that, if 
the sweat-glands are stimulated to action by heat, the reflex could 
increase with a rise in general surrounding temperature. Sidis 
and Kalmus (77) find that heat and cold have little effect on 
the psychogalvanic reflex, while Wells (114) reports that it is 
difficult to elicit the reflex m cold weather Evidently fur- 
ther investigation must be looked to before this point can be 
settled. 

12. Fatigue diminishes the reflex. In this connection Wechsler’s 
summarized statement of the diurnal variations of the reflex may 
be cited “The human body resistance tends to be least toward 
the middle hours of the day and greatest at night and in the early 
morning hours ” The lessened resistance during the day may be 
attributed in part to muscular activity and the increased lesist- 
ance in the evening peihaps may be explained as a fatigue phe- 
nomenon No explanation can be given for the high resistance in 
the early morning. 

jj Closely related to the foregoing is the decrease in the gal- 
vanometne deflection caused by repeated stimulation of sense- 
organs. 

14 Deep breathing provokes a psychogalvanic reflex. This fac- 
tor probably operates through the sympathetic nervous system, 
leading to changes in the surface arteries and sweat-glands. 
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15 Mental activity causes a diminution of the psychogalvai^ic 
reflex, thus acting in the direction opposite to that of muscular 
activity On the other hand hard mental work accompanied by 
strain or effoit inci cases the reflex 

16. Any direct increase in stimulation of the sweat-glands stimu- 
lates the psychogalvanic reflex and vice versa. Certain instances, 
exceptions to this rule, are noted m the experimental literature. 
Wells (59) reports a case in which profuse sweating caused by 
diug action was actually accompanied by a decreased psycho- 
galvanic reaction Such cases are rarely reported, however, and 
the testimony is ovci whelming that the leflex is positively corre- 
lated with sweat secietion. 

17. Drugs have a marked influence on the psychogalvanic re- 
flex, but here the reports in the experimental literature are ex- 
tremely conflicting 

18. Many investigators have reported that the psychogalvanic 
reflex can be elicited by sensory stimulation alone Starch (82) 
and Veraguth (92) believe that the amplitude of the needle de- 
flection conesponds loughly to the strength of the stimulus Since 
repeated stimuli cause the reflex to become less, it is evident that 
the psychogalvanic reflex follows the usual laws of attention and 
fatigue 

ig. The evidence as to whether the psychogalvanic reflex can 
be voluntarily controlled is conflicting 

20 Finally ernol tonal changes which cannot be shown to 
have somatic correlates cause a psychogalvanic deflection. It 
would seem as though, by the time all of the foregoing fac- 
tors had their influence on the psychogalvanic reflex, that the 
deflections due to the piesence of emotion would be quite 
lost and undistinguishable The psychogalvanic leflex is, of 
course, commonly spoken of as a measure of emotional reac- 
tions. The point is that during a biief peiiod in the laboratoiy 
when all external disturbances are eliminated and all physio- 
logical factors are kept constant so far as possible, galvano- 
metric deflections appear which are probably closely correlated 
with emotional changes The warnings must be repeated that 
we should be wary in comparing experimental results when 
the experimental situation vanes in any significant way, and 
that it is safest never to compare the results of two separate 
sittings. i 
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Practical Use of Physiological Measures of Emotions 

Physiological measuiement of the emotions could be employed 
in a -wide lange of situations for the purpose of determining the 
extent to which various situations arouse emotional responses 
Such mcasuies might be used in the school-room to discover the 
degree to which teachers, contests, reciting before the class, ex- 
aminations, and different methods of instruction or of control 
arouse emotional response. They might be used in the home 
to discover the influence of various situations and events They 
could be used in the theater, the motion-picture house, m in- 
dustry, in athletic contests — in fact every phase of life awaits 
testing of the arousal of emotional excitement Although one 
cannot generalize and although one certainly would not recom- 
mend that all emotional reactions be eliminated, it is true that 
a response which is accompanied by emotion is generally not the 
most satisfactory form of adjustment We look hopefully to the 
future therefore for exact knowledge of the situations which nor- 
mally arouse emotions. 

The Psychogalvanic Reflex and the Association 
Experiment 

Landis and Do Wick (56) are authority for the statement that 
Veraguth and Clotta were the first to point out the relationship 
between the psychogalvanic leflex and reaction time Jung (50), 
however, m 1907 was the first to capitalize the discovery by 
finding that the psychogalvanic reflex in connection with the re- 
sponse in the free association expeiiment was an index of an 
underlying complex or area of emotional response F Peterson 
(62) believes that the galvanometer deflection is directly propor- 
tional to the degree of emotion aroused. W. W. Smith has studied 
this relationship extensively As a result of his work he believes 
the psychogalvanic reaction is a better index of the affective 
nature of the response than is reaction time. He says (80, p. 74) 

“There can be no doubt whatever that for quantitative work 
the galvanometer deflection is a far more valuable indication than 
the leaction time It is not under voluntary control and is not 
affected to any appreciable extent by non-significant intellectual 
factors such as sometimes prolong reaction time Moreover, the 
absolute magnitude of the deflections can, in general, be magnified 
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to any extent desired and read with a corresponding degree of 
precision Still more impouant is the fact that the magnitude of 
the galvanometer deflection appears to be approximately piopoi- 
tional to the intensity of the coircsponding affective tone, however 
gieat the deflection ” 

As evidence for the latter statement Smith presents two tables 
giving the rank Older of 100 free association W'oids according to 
reaction time and according to galvanometer deflection The six 
words resulting in the highest reaction time are name, jnend, 
despise, make, sad, proud, and home, v\hile the six words giving 
the greatest galvanomctric deflection aie kiss, love, marry, divorce, 
name, and woman It is not difficult to estimate that the affective 
quality of the second group of words is greater. 

The psychogalvanic reflex promises to be of service m intei pret- 
ing the reaction experiment by indicating whether failure to re- 
spond or a long reaction time is due in the particular case more 
precisely to intellectual difficulty in intcrpieting the word or to 
a conflict of two or more innocuous stimuli, or is really accom- 
panied by emotion It might also be serviceable in the case of a 
prolonged reaction time to indicate whether or not implicit re- 
actions with an emotional coloring are arousea, though they pass 
without overt expression 

The greater number of investigators trying the psychogalvanic 
reflex on patients with mental disoiders have concluded that the 
reflex cannot be used with certainty in diagnosing mental disorder. 
The testimony of various investigators is very contradictory Some 
claim that the reflex can help m diffeientiating between hysteria 
and hysteiical simulation, oi between functional and organic 
mental disoiders, while othcis insist that the reflex is without 
power to distinguish between these cases Syz (87) found that pa- 
tients in catatonic stupor have an average electrical resistance 
twice as high as noimal persons, but such findings aic not helpful 
in diagnosing the nature of tlic difficulty, as other mental states 
also result in high electrical resistance 

Use of Physiological Measures of Emotion in the 
Detection of Deception 

Most of the work in which practical use has been made of the 
various physiological measures of emotion has been done in the 
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attempt to determine whether or not a person is lying To the psy- 
chologist this seems a rather picaynnish problem From the point 
of view of science it appears of comparatively small consequence 
that in a given situation a person has told the truth or a false- 
hood. Of more consequence is the knowledge whether a person 
has some emotional complex or some compulsive tendency or 
interest in committing a crime or other unsocial act. 

But in practical affairs there is often a strong desire to deter- 
mine the facts in a situation where deceit may be involved. In 
legal criminal procedure at present, where the whole emphasis is 
on conviction for a particular offense, there is a pressing practical 
need for some device which will give a check on the veracity of 
testimony and confessions So those who have worked in this 
field — Benussi (117) and Lombroso (127) some years ago in 
Italy, and more recently Marston, Larson, and Landis in this 
country — have approached the problem with an interest in its 
applications to police or legal procedure. The contemporary work- 
ers have held with naive enthusiasm to a belief that methods 
will soon be devised which will guarantee beyond doubt that a 
person is lying or telling the truth Marston (129), using systolic 
blood-pressure, claims an accuracy ranging from 90 to 100 per 
cent Larson (122, 123) claims over 90 per cent accuracy in his 
work with a combination of blood-pressure and breathing records. 
(How much over 90 per cent cannot be known because of dis- 
appearance of a subject, “inability to obtain a confession” ('), 
and other causes ) 

The reported experimental work, with the exception of that of 
Landis, is extremely unsatisfactory Altogether too much has 
been left depending on the individual judgment of the experi- 
menter Most of the work has been done from the point of view 
of being able to spot the individual liar rather than of systemati- 
cally investigating the relationships between the various measures 
used and telling the truth. The statistical handling of the data 
collected has been crude beyond words. The amount of deviation 
in any measure which distinguishes truth from falsehood has 
been established arbitrarily. In short, the experimental work to 
date is mainly rough pioneering. 

The theory underlying the work has been that when one tells 
the truth there is no particular arousal of the emotions, but that 
when one is deliberately concealing the truth by falsifying, there 
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is an accompanying emotional tension or strain. When telling the 
truth the tendency is toward relaxation, when lying, one is on 
guard, and there is a definite heightening of effort and tension. 
Of course the hardened criminal who is so obtuse or so prac- 
tised in his art that he can he without “batting an eye” would 
show no such emotional disturbance, but such an individual is 
comparatively rare, according to the belief of those who have 
worked in the field. Nearly every one can draw on his own ex- 
perience for testimony concerning the accompaniments of a he — 
the racing heart, the gasp in the breath, increased perspiiation, 
a dry mouth, and the like If one could be certain that these things 
always happeijed, and if they could be detected experimentally, 
then we might attain a sure-fire diagnosis of deception. 

Most of the experimental work has been carried on in the 
laboratory or college class-room with artificial “crimes” com- 
mitted by members of the class. Some work has been done in 
police courts with actual criminal suspects It is the belief of 
those who have worked in this field that the laboratory situation 
IS not really a fair trial for the method When the situation is not 
real, the emotional effects may be inhibited, but when a criminal 
suspect is really attempting to defend himself before society, the 
emotion is hard to keep down. 

Although any of the physiological measures of the emotions 
previously described might be used for this purpose, the two that 
have been most commonly tried are (a) systolic blood-pressure 
and (b) breathing. 

Marston (129) waxes enthusiastic over the possibilities of 
blood-pressure. Trying it out first in faked-up crimes in the 
laboratory and later in actual criminal cases during and after 
the World War, he achieved remarkable success In laboratory 
experiments Marston reports 96 per cent correct judgments of 
truth or falsehood on the basis of blood-pressure records, whereas 
a “jury” who listened to the questions and answers ranged from 
10 to 85 per cent correct with an average of 48 per cent Again, 
during the World War, Marston (128) reported 942 per cent 
correct judgments by himself, while privates in the United States 
Army, given the same blood-pressure records to interpret, were 
able to make correct judgments in only 74 3 per cent of the cases. 

Marston claims that there are typical blood-pressure curves 
of truth and falsehood. The curve of falsehood shows a steady 
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rise as the various questions are asked, reaching a peak at the 
crisis m the testimony The cuive of tiuth, on the other hand, 
pursues a level couise uitli few deviations. Natuially one must 
distinguish between the changes m the curve due to the presence 
of normal complexes and those due to lying. Marston believes that 
lying always produces a greater rise in the blood-pressure curve 
than any incidental emotional complex In his early woik he set 
eight millimeters’ use in the mercury level of the sphygmomano 
meter as symptomatic of lying, but later he raised this figuie 
to twelve millimeters 

Larson (123, p. 450) notes the following as indices of decep- 
tion. 

I Increase in blood-pressure. 

2. Decrease in blood piessure. 

3 Increase in height (pulse^). 

4 Inciease in fiequency 

5 Summative effects 

6. Incomplete inhibition (breathing^). 

7 Complete inhibitory effect 

8. Irregular fluctuations, especially noticeable at the base of 
each cardiac pulsation 

9 Combination of any of the above effects in the same indi- 
vidual 

10. These changes may occur with but little latent period, or 
they may be accumulative m effect and more generally dis- 
tributed 

Landis (119, 120), in collaboration with Gullette and Wiley, 
gave the systolic blood-pressure method of detecting deceit a 
careful trial and found that he was able to diagnose correctly 
only twelve out of twenty-two cases. That is only slightly better 
than chance Landis (121) concludes, “For this group and with 
this technique, blood-pressure reactions were an untrustworthy 
criterion of consciousness of deception ” But in another place 
Landis admits that his experimental conditions may account for 
his poor results and says, “We are of the opinion that the blood- 
pressure method of detection of falsehood is what Marston orig- 
inally claimed for it, ‘highly diagnostic,’ if all conditions are favor- 
able ” 

In using breathing for the detection of deception, the recent 
experimenters have gone back to a discovery made by Benussi 
that under the influerice of a strong emotion the time for inspira- 
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tion IS greater than the time for expiration When one laughs 
there is a tendency to take the breath in quickly and then let it 
out more slowly in little shoit gasps In anger or fear, on the 
other hand, one bieathes out rapidly, the inspnation is more legu- 
lar, and the breath is held after it is taken m Breathing is meas- 
ured and recorded by means of a pneumograph, as described 
earlier in this chapter Feleky (5) has determined the following 
inspiration-expiration ratios (I/E). 

Table 82 

InspibjITIon-Expiratios Rvnos ron Various Stimuli 
(from Feleky) 


Normal breathing 

0 

00 

Laughter 

30 

Hatred 

SIS 

Disgust 

I 08 

Pleasure 

I II 

Pam 

I S46 

Anger 

I 48 

Wonder 

249 

Fear ... 

266 


Benussi (117) claims practically 100 per cent certainty in using 
the inspiration-expiration ratio in the determination of deception. 
Although breathing generally is under voluntary control, the I/E 
ratio under the influence of emotion seems to be practically inde- 
pendent of intentional contiol Certain improvements in the 
method of recording the I/E ratio directly have been described by 
Burtt (12, 13). He was successful (118) in detecting the truth or 
falsehood in 73 per cent of the cases which he tested He believes, 
howevei, that blood-prcssuie is a more accurate index than the 
I/E breathing ratio, paitly because he notes that the influence of 
lying upon breathing appears to become less with habituation 
Larson claims exceptional success in using the inspiration-expira- 
tion ratio in conjunction with records of pulse and blood-pressure 
Landis (120), who with Wiley appioached the problem ex- 
perimentally and without evident bias, found that the I/E method 
gave diagnostic results in 63 per cent of the cases when subjects 
were questioned concerning the figures on a card and in 50 per 
cent of the cases when they were questioned concerning an imag- 
inary crime Landis states, “We may conclude that we are deal- 
ing with a real factor which does vary frojn truth to falsehood ” 
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It is evident that one cannot correctly diagnose deception unless 
all the conditions aic favorable. Some of these conditions are as 
follows • 

I Perfection of instruments Defective or carelessly set up in- 
struments will cause any laboratory experiment to fail, as will 
careless manipulation of the machinery Laboratory apparatus of 
the soit we are describing is not at present adapted to rough usage 

2. Reality of the situation There is evidence for believing that 
the ordinary demonstration before a psychology class of the ex- 
periment with planted and artificial crime does not produce the 
emotional shock necessary to give the method a fair trial Actual 
cases in the police court or “third degree chamber,” where the 
defendant is driven to the necessity of proving his innocence to 
avoid conviction, possess a reality which should cause emotional 
upheavals if anything can 

j. Familiarity with the procedure There is some evidence that 
the results are better when the subject is unfamiliar with the pro- 
cedure. If the subject knows that the complicated apparatus 
strapped on his arm and around his chest is measuring his heait 
action and bieathmg he may make a partially successful effort 
to control his emotions. On the other hand, if the subject is totally 
unaware of the significance of the experiment, his replies to the 
questions asked will be accompanied by all the usual emotional 
distuibance 

4. Types of questions Little is known of the emotional influ- 
ence of different types of questions This is a fertile field for 
investigation and one of the highest importance for criminal 
psychology It is not known, for instance, whether the emotional 
effect IS greater in a room with an atmosphere of excitement and 
tenseness in which there are several people present than in a quiet 
room alone with the experimenter It is not known whether it is 
more effective to plant significant questions casually between neu- 
tral questions, or to proceed with a string of direct leading ques- 
tions bearing on the crime, or to arrange the questioning so as to 
lead up to a dramatic climax It is not known whether the ques- 
tions should be put in a short space of time or whether the exami- 
nation should be strung out over a protracted interval A recent 
newspaper article describing the use of the “lie-detector” in forc- 
ing a confession made mention of continuing the inquisition over 
several days. If it was^continued too long, one might expect anger 
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to be aroused in an innocent person by mere impatience at being 
confined or insulted unnecessarily. 

5. Emotional susceptibility of the sub‘]ect It is well known 
that some persons are far more sensitive to telling a lie than others 
and that there are large individual differences in the emotional 
effect of telling a falsehood. 

There is immense popular interest in methods of detecting de- 
ception. A great deal of further careful experimental work must 
be done, however, before the method can be considered ready for 
practical use. 


Biochemical Measurements and Personality 

Rich (136, 138) has discovered certain rather surprising rela- 
tionships between the acidity of saliva and urine on the one hand 
and personality characteristics on the other hand that offer a 
promising lead in the diagnosis of personality. By making a deter- 
mination of the hydrogen-ion concentration of the saliva, he 
found a positive relation between the alkalinity of the saliva and 
ratings for excitability That is, since hydrogen-ion concentration 
decreases as a liquid becomes less acid, the most excitable indi- 
viduals have the least acid saliva and vice versa. Starr (139) earlier 
found the same relationships when working with two types of 
stammerers. 

Corresponding facts were found in testing the acidity of the 
urine, which correlates negatively with emotional excitability. The 
most excitable individuals have the least acid urine and vice versa. 

Similar relationships were found in correlating with rating for 
aggressiveness. The following table gives the findings. 

Table 83 

CoKRELATlONS OF ALKALINITY OF SaLIVA AND AciDITY OF UrINE WITH 
Personality Ratings 

(from Rich) 



39 UNDERGRADUATES 

18 GRADUATES 


Aggrts- 

Exext- 

Aggres- 



stveness 

abxltty 

siveness 

ability 

Salivary alkalinity 

+ 09 

+ 28 

+ SI 

+ 4 S 

Urine acidity 

— 31 

— 2S 

00 

-26 

Urine acidity (formal titration) . 

— 31 

— 06 

— 24 

— 27 

Urine acidity (formal titiation) 





divided bv bodv weielit 

— M 

— .12 

— 11 

— 17 
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The correlations indicate that bodily alkalinity goes with aggres- 
siveness and exeitability, whereas bodily acidity goes with passiv- 
ity and lassitude. 

Determination of Amino-Acid Nitrogen 

The waste products of metabolism contain a number of ammo- 
acids In ordinary urine these are approximately neutral m reac- 
tion If, however, formaldehyde is added, a reaction takes place 
m which the basic propeities of the ammo groups are destroyed, 
leaving carboxyl groups which are moie strongly acid These may 
be titrated against an alkaline solution and the acidity deter- 
mined. The method commonly used is known as the Henriques- 
Sorenson or Formal Titration Method. Rich added the acidity 
as determined by direct titration to the acidity as determined by 
the formal titration to give a measure of total acidity. 

The amount of acid found m a twenty-four hour specimen of 
urine has a direct relation to the weight of the individual, since 
the heavier an individual the more tissue there is which is being 
metabolized, and the larger the amounts of waste substances ex- 
creted in the urine Consequently Rich divided the urine and acid- 
ity findings in each case by body weight to eliminate what might 
prove to be a disturbing factor in his comparisons. 

The creatinine concentration of the urine was also found by 
Rich (131) to be associated with personality factors. Since crea- 
tinine IS a product of the process of normal metabolism taking 
place in the muscles which is finally eliminated through the urine, 
the creatinine output m the urine is a rough measure of the meta- 
bolic process. 

Rich found that the creatinine concentration of the urine 
correlates — 24 and — .10 with ratings of emotional excitabil- 
ity for eighteen graduate and thirty-nine undergraduate students 
respectively When the creatinine output is divided by the body 
weight (creatinine coefficient), the corresponding correlations 
with emotional excitability are — .23 and — 24 This small but 
consistent relationship indicates that here is a factor of some im- 
portance in determining personality trends. Less excitable indi- 
viduals tend to have a more rapid creatinine metabolism and vice 
versa. 

The method usually employed for determining the creatinine in 
urine is that developed by Folin and is called Folin’s Colon- 
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metric Method * This method is based on the characteristic prop- 
erty of cieatinmc, that of yielding a certain definite color-reac- 
tion in the piesencc of picric acid in alkaline solution. 

Each of these measures of metabolism varies according to cer- 
tain factors. 

The acid content of the saliva vanes according to the bacterial 
activity between the crevices of the teeth, which tends to make 
saliva acid, and according to other obscure factors related to 
vietabolisvi. 

The acidity of the urine vanes according to the influence of a 
number of factors, among them being’ 

I Diet Since diet may be one of the factors which determine 
excitability or aggressiveness, as well as the character of the 
urine, Rich made no effort to control diet, but let his subjects eat 
as they usually did His only restriction was to omit certain 
articles of food, such as liver, that have a rather marked influ- 
ence on the composition of the urine and would cause variations 
outside of the normal range. 

2, Water intake has a decided influence on the concentration 
and constitution of the urine. 

5. Musculai activity influences the acidity of the wine. 

4 The acid products in the urine are a function of bodily 
weight The heavier a man, the more muscular tissue there is 
to be metabolized 

5 Acidity of urine tends to be increased by certain pathological 
disordeis 

6 The acidity of the urine is a function of the time of day 
when It IS passed For some time after a meal the urine may even 
give an alkaline reaction This phenomenon is called the alkaline 
tide and is due to the chemical process of digestion. 

7. Urine may become alkaline, or even more acid, upon stand- 
ing for some time, due either to bacterial decomposition or to 
fermentation 

The creatinine concentration of the urine is said not to be af- 
fected by the amount of muscular activity over twenty-four-hour 
periods. 

j. Creatinine output is dependent in part on the nature of the 
food taken. 

*A description of the procedure may be found on page 732 of Hawk and 
Bergcim, Practical Physiological Chemistry, ninth edition 
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2. Creatvnvne output is a direct function of the metabolic process 
of the muscles in such a way that the muscular efiiciency of an 
individual depends upon the intensity of the process 

3 Creatinine output changes under pathological conditions It 
is said to increase in typhoid fever, typhus, tetanus, and pneu- 
monia and deciease in anemia, chlorosis, paialysis, muscular 
atrophy, degeneration of the kidneys, and leukemia 

Rich also determined the alkalinity and creatinine content of 
the blood, but since these methods should be undertaken only by 
one specially trained in blood-letting, they will not be described 
here. The same relationships, however, apply to the character of 
the blood that were found to be true for the saliva and urine 

One can only speculate as to the possible causes of the relation- 
ship between these physiological factors and personality factors. 
It is known, for instance, that glandular secretion is rendered easy 
or difficult by a change in the acidity of the blood An increase 
in the acid character of the blood tends to retard glandular se- 
cretion If these personality differences are due to extent of the 
secretion of the adrenal or pituitary or other glands of internal 
secretion, and the activity of these glands is directly related to 
the acidity of the bodily tissues, then the explanation of Rich’s 
findings IS clear But this is only a hypothesis. 

Again the creatinine excretion may be a measure of glandular 
activity If glandular pioducts act as catalyzers in hastening or 
retarding metabolism, this fact would be shown by the products of 
metabolic activity. Consequently if emotional excitability is also 
a function of glandular activity, it may be measured in this round- 
about way by the products of metabolism. These again are only 
speculations. 

Chemical determinations of personality are certain to loom 
importantly in future work, which may uncover important facts 
bearing on the causes of personality differences 
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Chapter XII 
INTERVIEWING 

I NTERVIEWING is a method of securing data by a face-to- 
face consultation or conference in which a person tells an 
interviewer his story or version of a set of facts ot ansueis 
questions concerning whatever problem or inquiry is at hand The 
interview may employ obseivation and lating as techniques, and 
it bears close resemblance to the questionnaire in many respects 
While in one sense interviewing has greater flexibility than an)"^ 
one of these methods, in that no schedule need be ngidly adhered 
to, in another way it lacks flexibility, in that the mental processes 
of another person are often intricate and intractable and attain a 
certain momentum, making it difficult to hold the conversation 
to the mam topic Since in the past some of our more exact testing 
methods have been developed fiom the mteiview, theie is e\ery- 
reason to believe that interviewing is now in piocess of elaborating 
through oxpeiicnce more acciiiate methods for investigating cei- 
tain phases of conduct than are now available In the inteivicw the 
measurement point of new is lacking, and the interviewer must 
make an inteipiotation fiom running narrative or isolated answeis 
to questions Though this is the oidinary common-sense way of 
sizing up a situation, it can hardly be thought of as an exact 
method 

Types of Interviews 

Three mam types of interviews should be distinguished. The 
dtagnosttc interview seeks to discover facts concerning the life 
history of the mfoimant and to extract from his nariative his 
opinions, attitudes, and personal experiences In the diagnostic 
intervierv interest is concentrated on the individual being inter- 
viewed In the rpt/’/irrh in^frvipw interest does not lie in the person 
being intcrviewedTexcept as he can contribute data helping to solve 
a particular problem A good example of this type of interview is 
furnished by the work of Charters (17) m making job analyses, 
as for example, whenjsecretanes were interviewed to discover the 
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operations which they performed, with attention centered on the 
jobs and not on the person being mlei viewed The thud tj'pe of 
interview inclu des ti eat me Hi The social case woikei is iiUcrested 
not only in obtaining facts regaiding her client, she ivishes also to 
persuade him to change somewhat his family situation 01 other 
adjustment Or the college dean hopes to elTect through the inter- 
view some improvement in the standing of the student befoie him 
The diagnostic and lieatinent intciview arc often inteiminglcd, 
and it is difficult to tell wheic one begins and the othei leaves off 
In the present discussion our concern is strictly with the diagnostic 
phases of the interview, and the inter\ lewei’s endeavors to influ- 
ence his client must be ignored 

Another difficulty in discussing the interview is that interview’s 
conducted by diffeient peisons diffei m purpose and m technique 
Inteiviews aie conducted by physicians, lawyers, priests, journal- 
ists, detectives, social woikeis, psychiatrists, psychoanalysts, 
deans, research workers, employment managers, anthiopologists, 
sociologists, and othei s The physician wishes a cleai and accuiate 
statement of disturbing symptoms, the priest wants a confession, 
the newspaper repoiter wants a racy story or an unusual and 
significant comment, the social woiker is anxious to discover social 
relationships, employment managers aie on the lookout for the 
aptitudes and character traits that make a man a valuable em- 
ployee The sociologist is particularly interested m the environ- 
ment, whereas the psychiatrist allows his interest to become ab- 
soibed by the mental mechanisms uncovered Each puipose re- 
quites Its own technique, and it is difficult to describe them 
together under the single heading “intei viewing ” 

Advantages and Disadvantages of the Interview 

The interview has often been contrasted with the questionnaire. 
Each resembles the other In both methods questions are asked 
and the answers are to be taken at their face value as facts, or 
used to show impoitant trends in the individual 

Charters, who has made effective use of the interview in de- 
termining curriculum objectives, says of this method in his 
Curriculum Construction* (19, p 134) 

• From Charters, W VV , Curriculum Construction By permission of The 
Macmillan Company, publishers;- 
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“In all cases the oral questionnaire is preferable to the written 
form. This means that when the teacher becomes an interviewer 
and asks the questions orally, he will obtain more reliable answers. 
He can clear up misconceptions of his meaning, and can supple- 
ment his questions by others which will elicit more definite an- 
swers. The chief objection to the oral questionnaire is the labor 
required of the interviewer; but this is amply compensated for by 
the greater trustworthiness of the information, the comparative 
ease with which the answerer is able to provide it, and the greater 
amount of time he will spend upon it. Answerers who dislike to 
write, and would spend very little time on written answers, are 
glad to devote considerably more time to an oral interview ” 

Whether or not the examiner will follow up leads during an 
interview, in order to obtain greater explicitness than is possible 
in the questionnaire, depends on his versatility and his desire to 
complete an adequate piece of investigation Meltzer (39) claims 
that the interview yields more meaningful and revealing responses 
[than other methods of inquiry. He lists as advantages of the inter- 
I view method that the responses are more natural, less artificial, 

I more informal, and more spontaneous. The interview gives the 
I examiner opportunity to investigate further generalities, obscure 
points, or evasions It permits the examiner not only to collect 
facts, opinions, attitudes, likes, and dislikes, but to find out the 
“why” of these responses Cavan points out that through the 
- interview it is possible to trace development more effectively than 
"with the questionnaire 

But the interview has its disadvantages Suggestion has as much 
force in the oral interview as in the questionnaire, if not more. 
Witness responses to a doctor’s request for symptoms If you are 
an “ailing” sort of person, you enlarge upon insignificant symp- 
toms; if you are a stoic, you conceal what may be important 
symptoms Likewise in any interview a person's judgment will be 
pulled or warped not only by the questions and their form, but by 
expression, intonation, gesture, and the like. “You only study a 
half-hour a day^” said with an air of surprise brings the defense 
response, “Why, usually I spend more time than that, but some- 
times I can get it m half an hour, and then I have my music to 
practice, you know ” 

I The interview, in addition, is much more expensive in time and 
I cannot reach as large a group of informants as the questionnaire 
with an equal expend.ture of time and effort. 
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/ Finally, because the interview as free response cannot be 
/ “scored,” it cannot be quantitatively treated and interpreted This 
denies to the interview amenability to accurate interpretation that 
can be accorded the data from a carefully assembled questionnaire 
Davis (23) reports a comparison of the questionnaire and inter- 
view methods by McCall that has suggestive implications The 
replies of fifty women interviewed as to their sex life were com- 
pared with the questionnaire, replies of fifty women of as nearly 
equal age, education, and mental status as possible Unfortunately 
the differences reported in this study were not treated statistically 
for reliability, and there lurks a suspicion that many of them 
result from mere chance The main findings are as follows 
'l. The questionnaire method is more accurate, judged on the 
basis of eliciting a larger number of admissions of sex practices 
among the group Since these practices are socially disapproved, 
we are tempted to generalize from this finding that an anonymous 
questionnaire is more reliable than an interview in revealing dis- 
approved practices 

^2. The interview method is more reliable in that it yields fewer 
inconsistent replies. This may be due to ill-defined questions in the 
questionnaire or to the fact that, if necessary, questions in the in- 
terview can be elucidated further. 

j. The interview method did not produce appreciably more an- 
swers with definite information (1. e, leplies with entire frankness 
and candidness). 

'4. Slightly more information relative to childhood impressions 
and feelings was secured by the questionnaire 
5. While anonymity of the questionnaire method may con- 
tribute a distinct advantage over the inteiview method if a follow- 
up of individual cases is desired, the very anonymity of the former 
method renders a follow-up investigation impossible, except of 
course where additional general questions are addressed to the 
wMe group. 

'6. The interview method secured 34 per cent replies of those 
solicited against 32 2 per cent for the questionnaire — a negligible 
difference. 

^7. That the interviewer contributes anything through his judg- 
ment of the personality of the subject or through detection of dis- 
honesty is doubtful 

When we consider the greater ease with vjjhich the questionnaire 
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method may be employed and that it may be depended upon to 
yield fully as complete and arrurale and frank results as the inter- 
view, the results of this study seem to indicate that the ques- 
tionnaire method is pieferable. 

Standardization of the Interview 

The matter of standardization is a point of issue concerning the 
interview and one that has dnided interviewers into two camps 
On the one hand there aie those who plead for more flexibility 
in the interview The inteiviewer is admonished to adapt the 
interview to the individual, so far as possible becoming all things 
to all people He is told when conducting an inteiview, to “do in 
Rome as the Romans do” and to adopt the language, mannerisms, 
dress, and attitudes of those he is questioning He is told to follow 
no fixed schedule of questions but to endeavor to get his subject 
to conveise naturally and to use questions only to follow up prom- 
ising leads He should analyze the story as it proceeds and be 
versatile in redirecting the thread of thought wheievcr clues are 
most promising The object of this type of inteiview is to draw 
the informant out and obtain an unconstrained account of his 
prej'udices, motives, drnes, passions, etc 

On the other hand there are those who advise planning the 
interview carefully ahead of tune The questions should be framed, 
printed in a schedule, perhaps rehearsed Hamilton (27), in his 
study of maiiiage iclalions, was so fearful lest the intonation of 
his voice should suggest an answer, that he prepared all of his 
questions on typewritten cards and handed them to his subjects 
one at a time The responses tvere as free and untrammeled as 
possible Others have gone still further and have not only pre- 
pared the questions carefully beforehand, but have attempted to 
standardize the selection and formulation of the questions much 
as IS done in test construction Snedden has prepared a disguised 
intelligence test in the form of an interview, while O’Rourke (46) 
has prepared problems in interview form for testing the judgment 
of applicants for the position of prohibition agents. Used in this 
way the interview approaches the questionnaire or test in degree 
of standardization. 

Those who would make the interview flexible are keenly aware 
of the difficulty of getting persons to express themselves freely. 
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Those who would standardize the interview are most conscious of 
the VICIOUS influences of suggestion in biasing the results, of the 
danger of laying too gieat stress on unprofitable clues 01 the 
answers to insignificant questions Future developments aie certain 
to lead to increased experimentation with methods of standardiz- 
ing the interview. Since our experience with tests has shown how 
extremely important to the results the manifold variable factors 
in the situation arc, it is imperative in the interests of increased 
accuracy in the lesults of the interview that there should be a 
more thorough try-out of uniform methods On the other hand, we 
should avail ourselves of every device and technique for enlisting 
the rapport of the subject, for breaking down resistance, and for 
encouraging free expression 


The Interviewer 

Qualifications. Since the inteiview is so subjective a process and 
one in which personal qualities play so large a part, the personality 
of the interviewer is an important factor. In the discussion of 
tests, questionnaires, etc , our mam concern was with the accuracy 
of the data In the interview the main consideration seems to be 
getting any data at all An interviewei is rated primarily for his 
success in drawing subjects out and getting them to talk freely 
about themselves. 

The interviewer should first of all be thoroughly familiar with 
the field of investigation If he is a school counselor 01 college 
dean, the interviewer should know all about the institution — its 
curriculum, its traditions, and its clientele If he is a vocational 
counselor, the interviewei should knou all about industrial and 
business opportunities, Avorkmg conditions, type of job available, 
etc If he IS a social woikei, he should have adequate knowledge 
of the social environment of the home — the churches, schools, 
clubs, stores, and places of amusement 

Charters (18, p. 285) wants his interviewers to be “logical- 
minded.” What does this mean except that they should be gen- 
erally intelligent and keen-witted to sense the forces at play in the 
life of the person whose story is being unfolded^ Charters also 
wants his interviewers able to “dig in,” by which he means able to 
follow up promising clues to elaborate some point on which the 
subject fails to go into detail. • 
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Gnffitts {26) speaks of a knowledge of psychology as being 
essential to the equipment of the interviewer Much depends 
on the psychologj'^, however, and certain forms of academic psy- 
chology are little more than useless A detailed knowledge of 
mental mechanisms, howevei — the rationalizations, attitudes, and 
motives that guide men’s behavior — should be of aid in inter- 
pretation 

The interviewer should be well adjusted himself. The psychia- 
trists tell us that one cannot be a successful interpreter of the 
mental mechanisms of others until he has seen fiist to his own 
adjustments The interviewer should be poised, emotionally stable, 
not prone to show undue excitement, and he should understand 
himself. Some say that one must actually have lived through 
various forms of experience before he is able to cleaily recognize 
and diagnose them m others 

The interviewer should command the respect of his informant 
without awing him His position should be one of prestige in 
the eyes of his subject, but he himself should be friendly and 
cordial His personal appearance must be acceptable There must 
be notmng in the appearance or approach of the mterviewei 
that will tend to offend or disgust the other person He should 
be plainly, neatly dressed, not conspicuously nor untidily. He 
should be as educated and as refined as those with whom he is 
to deal, but if possible not more so On some occasions he 
should show considerable savoir jaire, on other occasions he 
should adopt the language, gestures, dress, and bearing of the 
person he is to interview 

Since) ity and sympathetic understanding are important char- 
acteristics for the interviewer to possess He must be genuinely 
interested in the problems and troubles of other persons, and 
be ready to listen attentively to a long-drawn-out tale of woe 
He should be able to see the other’s viewpoint and to share 
the other’s hopes and fears whole-heartedly, without a tendency 
to ridicule or to criticize them Kindness and tolerance rather than 
coldness, austerity, and superciliousness should characterize the 
interviewer He should neither approve nor condemn the errors 
and lapses of his subject nor should he exhibit surprise nor 
maudlin sympathy, but genuine whole-hearted understanding. He 
should not betray the slightest trace of surprise or shock at the 
disclosures made to him, but receive everything in an impersonaL 
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detached, unsentimental manner The person interviewed must 
be convinced that he is talking to a friend who will not betray 
him and who is primarily mteiested in his welfare 

The interviewer must be courteous and respectful, displaying 
the deference which is appropriate to the occasion. Seldom should 
he contradict or dispute another’s point of view, except where 
this is done purposely to make a man take sides and express 
himself vigorously The intei viewer should be a person of vigor, 
one who will command the attention of the informant He should 
have the warmth and heartiness about him that breeds a sense 
of confidence and security 

Finally, the interviewer should have a sense of humor Often 
in an interview tensions and inhibitions will arise which block the 
free development of the story The interviewer must be on the 
watch for these emotional blocks, and stand ready to ward them 
off A sense of humor sometimes will help one in smoothing over 
a situation which is developing uncomfortably 

Training. Although many of these characteristics are so fun- 
damental that the interviewer must be selected for them rather 
than trained in them, there are certain matters of technique m 
which training should be of distinct advantage Schools for train- 
ing psychiatric social workers give definite instruction in the art 
of interviewing There are definite techniques in making the ap- 
proach, gaining rapport, and breaking resistance The groups 
which believe in a more complete standardization of the interview 
process gives instruction in the precise choice and formulation 
of questions, in the recording of answers, and in the interpreta- 
tion of the answers. Accurate studies of the interview technique 
have yet to be made, but we may suimise that training the inter- 
viewer will enhance greatly the reliability and significance of the 
interview. 


Voluntary or Involuntary Interviewing 

Should the interview be voluntary, or may it sometimes be 
involuntary.^ All depends on the situation The applicant for a 
job, the confessor before the priest, and the sick man before 
the physician seek the interview On the other hand, testimony 
in a court of law is often decidedly involuntary, as when the 
witness is activated not by the primary wish to answer any ques- 
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tion that may be put to him, but by a desire to clear himself or 
others of accusation In school the interview is sometimes op- 
tional, as when a student seeks advice or guidance, at other 
times It IS involuntary, as when a boy is sent to the principal’s 
office for misbehavior 

When the interview is voluntary and initial resistance is lack- 
ing, one may expect the answers to be on the whole truthful and 
complete Even in the voluntary interview, however, the subject 
may either intentionally or unintentionally withhold some of his 
story, or color it so as to show himself in the most favorable 
light But It IS in the involuntary interview that all of the arts 
of the interviewer must be used At every step some sort of 
appeal to the self-interest of the subject must be employed To 
persuade him to consent to the interview in the first place he 
must be promised that it will help him attain the position, suc- 
cess, or strength that he craves, or understand himself better, or 
avoid failure or incompetence Sometimes in the research type 
of interview it is possible to obtain cooperation by referring to the 
subject’s position of prominence or by remarking about his expert- 
ness. “We are canvassing the opinions of some of the outstanding 
men in the freshmen class” is almost certain to break down re- 
sistance and reserve If the name of some person or institution 
can be used to add authoiity to the interview, this may help 
to overcome initial resistance The old trick of the book agent 
who sells the fiist book in a tovvm to the minister of the local 
church need not be employed, but any just cause for an interview 
can usually acquire institutional backing 


Preparation for the Interview 

The preview. As a preparation for the interview one should 
try to find out everything possible about the subject A news- 
paper reporter interviewing a celebrity should have read his bio- 
graphical sketch in Who’s Who and should know all about his 
recent exploits The school counselor before interviewing a pupil 
should consult the school records and be familiar with such facts 
as his age, grade, teachers, marks, participation in extra-cur- 
ricular activities, intelligence test score, physical examination 
record, and whatever else the recorded evidence may yield. A 
knowledge of his reputation with his various teachers, his out- 
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standing behavior characteristics, hobbies, interests, and note- 
worthy achievements would be ol help The social ^^olker should 
be thoroughly convcisant with the past history of his client’s 
case, from the occasion of oiiginal icfcrence thiough all suc- 
ceeding intei views and recommendations Armed with such pre- 
liminary information, the interviewer will be in a position to 
make an effective opening, to gain rapport, and to procure the 
desired additional data most readily 

Making initial contacts and appointments. Gray and Monroe 
(25), m interviewing adults on their reading interests and habits, 
paid particular attention to establishing initial contacts A letter 
was sent to make the first contact and this was followed by a 
telephone call to make the appointment A definite time for the 
interview should be set and this time should be rigidly adhered 
to by the interviewer. In cases where the initial appointment 
cannot be arranged by telephone, a letter of introduction may be 
used. 

Conditions of the interview. The time and place of the inter- 
view are important factors in deteimming its success Ample time 
should be provided for an interview, since the appearance of 
leisure to talk over problems to any desired length is essential 
Those who see the interview as an informal way of getting facts 
recommend that the interview be held wheiever and whenever 
It IS most convenient to the subject The reporter attempts to 
catch his subject on leaving the boat or boarding his train or 
m his hotel Time and place are made to fit the convenience of 
the person being mteiviewed The social worker finds that it is 
sometimes helpful to hold the interview m the home, not only 
for convenience to the subject, but also because one then 
gets a clear picture of the enviionmental backgiound of the 
case. 

On the other hand, strong arguments have been proposed for 
the office interview In the office the pnvacy and jieedom from 
distracting influences can be had which it is usually not possible 
to get at home For the type of interview where inhibitions must 
be broken down, an atmosphere of security and time to talk 
over problems is requisite The interviewer must breed assurance 
that he is the sort of person in whom one may confide in 
safety, and that he has a genuine interest in the welfare of his 
subject. Balanced against the privacy of jhe office is the sense 
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of strangeness which in some cases may become an inhibiting 
influence 

If one IS to hold interviews m his office, some attention should 
be paid to its atmosphere The rvaiting room should be pleasant 
and comfortable, with a living-ioom atmosphere that is at the 
same time professional Comfoitable chairs and interesting maga- 
zines must be provided The office itself should be neat and busi- 
nesslike, the fewer distracting objects about the better Probably 
in the interests of privacy, secretaries or stenographers should 
be out of sight. Subjects should be made as comfortable as 
possible, perhaps even reclining so that there may be comfortable 
relaxation. 

In general we may ctjnclude that if we are primarily interested 
in the sociological background of the subject, he should be inter- 
viewed at home or in some other characteristic haunt, but if 
the purpose of the interview is to get the subject to disclose his 
own attitudes and problems, perhaps the office with its privacy 
and security is the best place. 

Steps and Procedures in Interviewing 

Those who have analyzed interviews find that they shape up 
much like any other story In form they bear resemblance to the 
novel or the drama Four separate stages have been recognized 
the introduction, rising action, denouement, and conclusion. 
Sometimes one or another section is lengthened out or telescoped, 
but they all appear unmistakably. Each step of the interview 
has Its own techniques, and a discussion of them will illustrate 
the procedure 

Introduction. The introduction may be brief or long according 
to circumstances If the interview is voluntary and the problem 
clear-cut, it may proceed at once to the point at issue. Usually, 
however, there must be a brief period of getting acquainted The 
proffer of a cigarette, an invitation to lunch, a ride in the car 
may serve to break down reserve In the office interview the 
interviewer may meet his client with a hearty handshake The 
usual bromidic phrases about the weather, or remarks about 
the latest news or rumors always serve as conversation starters. 

Often one needs at the outset to describe briefly but honestly 
the purpose of the interview. Students may come to a personal 
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conference with a counselor, holding an entirely wrong notion 
of its purpose The first requisite to put the student at case, allay 
apprehension, or bieak down diffidence is a simple statement of 
the purpose of the interview At this time one should stress the 
advantages that can come to the inteiviewee fiom the confer- 
ence, or, if it IS a research interview', the contiibution he can 
make to the study under way The subject must be persuaded 
that the interview is to be worth w'hile, that the interviewer is 
competent, and that he can handle the problem professionally. 
The interviewer should do everj'thing possible to relieve the 
subject of fear or embarrassment and to assure him of his own 
genuine desire to be helpful and his intention to give a square 
deal. 

In the case study interview, where one wishes to cover the 
topics of a prepared schedule, it is wise to open the conversation 
on some topic or hobby of special inteiest to the person inter- 
viewed. Most persons are quite ready to talk about themselves. 
Maulsby (38) suggests that such questions as, “When are you 
going to get married^” or “When are you going to set up busi- 
ness for yourself^” never fail to unleash the tongues of the per- 
son questioned because they tap the deepest aspirations and 
longings of most persons Similarly for school-children, “Are 
you going to camp this summer’” or “Who is going to win the 
game this week'” or “What kind of marks are you getting this 
term?” break down shyness The latter is a valuable lead, espe- 
cially if the marks have in the past been good In general the 
interview should open on topics dealing with successes and tri- 
umphs rather than on those connected with failure 

Sturtevant and Hayes (63) suggest that the first question in 
an interview or any point involving personal difficulty should 
relate to another individual lather than to the person himself. 
For instance they advise asking, “Docs your father scold you 
about spending money rather than “Do you spend more money 
than you ought This flank attack avoids the direct semblance 
of criticism, and helps to steer the subject away from rationali- 
zation. 

Securin g rappor t. Often, if there is suspicion or hostility on 
the part ot the subject, definite techniques must be employed 
to disarm him, gain his confidence, and secure rapport One must 
bring himself down to the subject’s level, use slang or colloquial 
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language where possible But one must not “talk down” to a 
child. The intertiewer should convince the subject that he is an 
insider, worthy of confidence, lather than an outsider This means 
that the interviewer must have had considerable contact with 
the social group represented by the subject 

If possible the interviewer should tie himself up in some way 
with the subj’ect’s past experience A reference to common friends 
often makes an excellent bridge with which to span the gap 
The discovery that both have lived in the same village, visited 
the same city, taken the same trip, enjoyed the same show, or 
attended the same school will serve as an effective means of 
gaining rapport 

If possible the subject should be made to feel that he is leading 
the interview This can be done by giving him free rein in ex- 
pression and asking questions which follow up leads already 
opened 

Another suggestion by Biisley (10) is that something should 
be done to relieve the timidity or tension or embarrassment at 
the beginning The social tvorkcr may offer some immediate 
measure of relief for the present need, such as new clothes for the 
children, or food for a short time The school counselor may 
promise to intercede with a teacher 01 the school puncipal Once 
the tension has been removed from the immediate situation, the 
subject IS freed for a more rational appiaisal of the total situa- 
tion. 

Rising action. When by means of the introduction a satisfac- 
tory degice of rapport has been established, one should come di- 
rectly to the point at issue At this point a question of technique 
arises as to whether it is bettei to allow the subject to tell his 
narrative with as little coaching or prodding fiom the interviewer 
as possible, or to begin by asking a senes of questions Ex- 
perimental work by Cady (15) and earlier uoikers has demon- 
strated that more ideas are expressed, and expressed with less 
error, when the free nanative comes first and is follotved by 
more detailed, questions than by any other combination Conse- 
quently It IS best not to grill, coerce, give advice, or show au- 
thority at the opening of the interview, but to wait until the 
subject is ready to tell his own story The interview is not a cross- 
examination, but a matter of cooperation. 

A kindred issue upon which authorities differ concerns broach- 
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ing the immediate problems first or letting them wait until some- 
thing of the histoiy of the subject has been revealed Mowrer 
believes that the immediate problems should not be discussed 
until the subject has had an opportunity to tell his life story, or 
story in general, a method which postpones coming to the dis- 
agreeable issues until a feeling of confidence and familiarity has 
been established On the other hand, under certain conditions it 
may be well to come to the point at issue directly so that testi- 
mony from the life history can bear upon It 

It IS unanimously agreed that the subject should be allowed 
to express himself freely, and even to ramble if need be, at ^he 
opening of the intciview Most persons in trouble have a certain 
amount of pent-up emotional irritation that has to be discharged 
before they can think calmly and rationally on their problems. 
Let them blow off steam and tell about the injustices that they 
have been subj'ected to, the accidents and misfortunes that are 
theirs, the jealousies, spites, and revenges that they are har- 
boring. During this stage of the interview the interviewer must 
show much patience and forbearance He must make every effort 
to really see his subj'ect’s point of view and to agree with him 
so far as possible at every stage. To contradict, to argue, to 
dispute at this point would be fatal The subject should be al- 
lowed to “get It off his chest” without interruptions At this stage 
plenty of time should be allowed 

Certain definite techniques for encouraging free expression have 
been analyzed by Palmer (47) First of all are the introductory 
remarks or questions which indicate a genuine interest on the 
part of the interviewer Then there are the gestures, nods of the 
head, smiles, encouraging remarks such as “Uh-huh,” “Isn’t that 
interesting?” or “Isn’t that strange?” Facial expressions which 
reflect sympathy with the emotions in the unfolding tale help the 
interviewee to warm up to his story Nearly every one responds 
to an appreciative audience The reticent individual who finds 
it difiicult to disclose his personal secrets should not be told to 
“go on” but should be asked, “Is there anything else?” or “What 

about ?” Definite leads into the unknown are more potent 

than prods from behind 

Nothing IS more impoitant in the interviewer’s technique than 
being open and frank himself There should be no occasion for 
an interviewer saying “I can’t take that up with you j'ust now” 
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or using some device for turning the topic The interviewer 
cannot expect Ins subject to be open and frank unless he is so 
himself If h's genume wish is to help his client, there should 
be nothing to be hidden or withheld The person being inter- 
viewed should have access to e% erything that he desires to know 
which wi’I help him understand the situation The interviewer 
should paiticularh atoid being put on the defensive himself, as 
this would defeat the whole purpose of the interview. 

Mot '.at' on One who undertakes interviewing must have a 
thorough knowledge of human motu'ation and skill in adapting 
this kno.v'cdge to parlicu'ar situations He must, of course, know 
in general what tj'pes of appeals are effective, but since not all 
persons respond to the same appeals, he must know also wdiat 
these different appeals are usually associated with It all comes 
down finally to a matter of knowing the individual concerned 
and w'hat forces he is responsive to 

As an illustration, take the appeal to prejudice Some preju- 
dices aie nearly univeisal, but others, moie specific, aie 1 elated 
to race, nationality, religion, sex, occupational gioup, economic 
level, or political party The interviewer, to use prejudice as a 
motivating force, must know the common prejudices, what they 
are associated with, and W'hcthcr these factors apply to the 
individual being questioned The appeal which will arouse the 
prejudices of a labor-union member will have just the opposite 
effect on a w'ealthy mcmbei of the Republican party 

Appeals may be made to ambition, piide, ideals, weaknesses, 
foibles, scruples, desires, tastes, esthetic sense, sentiment, sense 
of humor, sense of justice, and alliuibm, and these are but a 
few of the countless factois which at one time or anothei niav 
be Tised foi motivation 'I lie appeal will be by that much moic 
effective, howcvci, the moie sincere and genuine the basis on 
w'hich It IS made 

Techniques jot lessening tension and meeting resistance. Ran- 
nells (53) has analyzed the situations which cause tension and 
resistance on the part of the person being interviewed She lists 
five groups of causes (a) Causes arising from environmental 
feelings. The setting or situation has already been referred to as 
a factor in making for free expression or leading to inhibition 
(b) Causes arising from differences between the social worker 
and the interviewee, (c) Causes arising from the interviewee’s 
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jntellectual or emotional reaction to the particular subject or 
situation under consideiation (d) Causes arising fiom the habit- 
ual reaction pattern of the interviewee Rademachcr (52) has 
given a very penetiatmg analysis of some of these habitual re- 
action patterns His whole discussion should be lead, but the fol- 
lowing quotation illustrates his mam thesis (52, pp 82, 83): 

. . . “in cases where a child is spontaneous, enthusiastic, and 
talkative, analysis of the material shows, after all, very little 
of an actual conflict situation levcaled verbally Often the very 
verbalism of the patient, while it makes for interesting and lively 
stafF-discussion material, defeats the psychiatrist’s attempt to un- 
earth a discussion of subjective attitudes There are many rea- 
sons why this IS so Of impoitance is the fact that usually such 
material is painful for the patient to discuss Only certain types 
of children derive pleasure fiom a discussion of their inade- 
quacies or inferiorities, and in such cases these deficiencies are 
not important sources of conflict Again, these verbalistic chil- 
dren are so simply because they have learned the value of this 
method in gaming their own ends They have discovered the 
value of verbalizing or becoming ciicumstantial as a method of 
evading unpleasant situations or of gett.ng them out of as- 
suming responsibility or putting forth effort toward any ac- 
complishment, In other children veibalism is the essence of the 
conflict situation itself, m that these children often find adjust- 
ment to childien of their own age difficult because of their close 
contact with their parents and their parents’ fiiends The child 
knows how to impress adults and attempts to do this with the 
psychiatrist in his effort to avoid discussion of the unpleasant 
relationships that actually are his with other childien 

“There are many reasons why only a minimal amount of actual 
subjective feeling is obtained fiom a child in an initial contact. 
A child who IS afraid in new situations is from the start under 
an emotional strain which does not allow him to converse freely 
His responses become stereotyped and his attention inadequate. 
A child who is apprehensive as a result of some recent misde- 
meanor will also react to that fear Usually he will attempt to 
rationalize his conduct, to show his innocence in the matter, or to 
place the blame elsewhere He will spend much time attempting 
to impress the examiner with his good intentions and his high 
ideals, and will not face issues that involve his personal respon- 
sibility. The child who comes from a home in which discipline 
has been repressive is often so suppressed that he is actually 
afraid to express his feelings on any subject Often, too, such 
a child, as a result of his suppression, has no feeling of secuiity 
in any of his relationships The home, which may be at the 
basis of his difficulty, offers the only security that he has, so that 
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he will not readily commit himself in any way that may affect 
this tie Children who have been accustomed to overprotective 
attitudes in the home have difficulty in facing situations that 
require personal effort As a result, then behavior becomes of a 
compensatory or extravagant nature with the aim of bringing 
attention to them easily. Otherwise, because of a personality 
molded by oversolicitous care, they are ridiculed by other chil- 
dren They do not like to face the idea that they are not well 
suited to meet outside responsibilities or that they are dependent 
on their parents At the same time, this very situation is an agree- 
able one to them and naturally they do not wish to give it up 
The responses of these children aie often what might be termed 
‘the proper thing to say’ rather than an expression of their actual 
feelings Problem children from homes m which discipline is in- 
consistent and highly tinged with emotion realize that notwith- 
standing the nagging, scolding or whippings they occasionally re- 
ceive, such a situation works to their advantage in the sense 
that they are able to have then own way Therefore, their re- 
sponses are quite likely to be conventional or stereotyped ones.” 

(e) The fifth group mentioned by Rannells consists of causes 
arising from the uncertainty of response (53, p 92): 

“Fear of disfavor in the eyes of the social worker or the com- 
munity, the danger of ostracism from a particular social, re- 
ligious, or professional group if the adherence to certain non- 
conformists’ views is discovered, the appieliension of inadvertently 
betraying another’s confidence, the desire to avoid legal entangle- 
ments which might result from the disclosure of certain facts — 
all of these may act as an impediment to the setting up of a 
natural and friendly relationship.” 

Definite techniques are available for lessening tensions and 
meeting resistance One device is simulated agreement The writer 
discovered how effective this device could be in dealing with a 
certain student when a point of disagreement had been under 
discussion and each held to his point of view stubbornly Finally 
I found an error in my position and yielded To my surprise the 
student immediately yielded not only the main point at issue but 
several other points which he had consistently maintained My 
confession of error broke down his resistance at many points. 

I A second device is that of minimizing the seriousness of the 
/client’s position This can be done in such various ways as quot- 
I mg the many favorable factors, the assets of the situation, or 
mentioning other people who are having the same difficulties, 
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or citing statistics to show the relative unimportance of the 
point at issue Frequently persons get upset because the un- 
favorable factors m the situation loom so large that they are 
seen quite out of proper perspective. To recall to them the in- 
significance of the things that give them fear and worry as 
compared with the neglected favorable factors in the situation is 
often reassuring But enlightenment of this kind must be con- 
crete and simple enough to influence comprehension Most per- 
sons can be influenced more effectively through anecdotes than 
through statistics, even though the anecdote really proves nothing. 

Another device is that of analyzing a general statement into 
specific parts The school pupil may believe himself to be a 
failure when analysis shows that his difficulties lie in only one 
subject. Quite frequently a specific experience is made the basis 
of a generalization which is then interpreted as rendering the 
whole situation irretrievable or dismal 

The converse technique is to introduce a general statement to 
displace attention upon specific details The boy who resents 
punishment for a misdeed should be told that his teacher is fair 
and just, or the football player who misses the punt needs to 
be reassured that he rates as a star player 

An entirely different method for lessening tension employs 
physical contact between interviewer and his subject The hand 
on the shoulder breeds confidence, or the touch of a hand fur- 
nishes j'ust that slight suggestion of intimacy that conveys as- 
surance and allays suspicion 

Other devices for meeting resistance have been alluded to be- 
fore The skilled interviewer betrays no shock or surpiise at any 
admission. The boy who confesses going with bad companions, 
cheating, theft, or masturbation should encounter in the inter- 
viewer no critical or reproachful attitude, but a quiet acceptance 
of these admissions as so much evidence The interviewer should 
evince no surprise or discouiagement when his client admits di- 
vorce or an illegitimate child 

The emotional reactions of the client — anger, weeping, sullen- 
ness, shouting, berating, and the like — must be ignored so far 
as possible by the interviewer Sometimes these are reactions to 
the situations under discussion, at other times they are simulated 
attempts to influence the interviewer At all events the interviewer 
should not be swayed by them. He should *not talk back angnly, 
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nor show undue sympathy to weeping, nor should he coax or 
threaten 

Evasions of the interviewee, however slight, must be noted 
Topics vhich aic evaded should not be pressed, at least by a 
frontal attack, but sometimes it is possible to come around to 
them easily through collateral topics At other times it is wise 
to postpone their discussion until a later date when othei facts 
have been uncovered which make the point at issue more easily 
approached 

Still other devices for breaking resistance come under the head- 
ings “jollying,” “flattery,” “humor,” and the like It seems too 
bad that such cheap devices as these should have to be consid- 
ered in listing techniques for dealing with other people But 
human beings have their susceptibilities and their failings, and 
these repiesent avenues of approach that are effective Our philos- 
ophy often must admit that the end justifies the means, par- 
ticularly when the end is a worthy one and the means are not 
in themselves haimful 

Bogardus (7) has listed several mechanisms of mental release. 
First he mentions the naive type of release, exhibited in persons 
who are only too glad of an opportunity to converse freely and 
willingly, to talk shop, to relate an incident or to make a con- 
fession The second sort of release mechanism is called egotistical, 
because an appeal to vanity or pride will unlock the doors of 
reticence A thud mechanism is the confession, which by releasing 
pent-up tensions gives relief A fourth mechanism of release is 
termed scientific, to describe the attitude of the person whose 
deposition is “foi the sake of truth and science,” who takes an 
objective viewpoint concerning himself, who scorns much of the 
conventional code of judgments of “right” and “wrong,” and who 
IS even willing to “sacrifice” himself “for the sake of science ” 
The sophisticated type of mechanism is shown by those whose 
experience makes them inaccessible to the ordinary appeals — 
toughened individuals such as latvyers and physicians who know 
too much to “spill” It to the inquisitor ingenuously. 

Breaking down defense mechanisms Where there are definite 
mechanisms at work which cause the subject to defend a po- 
sition more or less stubbornly, rcsoit to extreme measures capable 
of stirring powxrful motives may be necessary to dislodge him 
from an intrenched position Some of these measures are taken 
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from salesmanship, others from legal practice Although, in gen- 
eral, measures which approximate cocicion or foicc should not 
be employed, since the mteivicw should be a coopeiatne enter- 
prise, they are sometimes necessary The first rule is to avoid so 
far as possible putting another person on the defensive If the 
interview is piopcriy opened and lapport obtained, there will 
seldom be provocation for a person to adopt a defense mechanism. 

One useful bit of strategy is to describe the ultimate outcome 
of things as they aie going To a pupil m school one may hold 
up the picture of failure at the end of the year, expulsion from 
school, failure to get into college, as being the outcome of 
his present behavioi Though this may to some extent involve 
fear, the endeator should be to have the subject project himself 
by anticipation into the ftituic without emotion It is most im- 
portant that he see his present situation in the light of its eventual 
consequences. 

A second device is “abusing for defense.” It is a well-known 
fact that most persons cannot bear to hear criticized or berated 
certain persons or things which they themselves uill at times 
criticize or abuse or perhaps neglect If the interviewer syste- 
matically proceeds to criticize a pupil’s parents, or his teacher 
or school, in all probability the pupil will rush to their defense, 
forced to find commendable featuies that no other technique could 
get him to lecognize or admit This is a dangeious device for the 
interviewer to play with, but is often most effective 

Other related devices are listed by Salsberry (57) under the 
headings “puncture,” “rushing,” “snayiiig by oratoiv,” “taking 
client off his guaid,” “using acquiied information,” “putting 
cards on the table,” “closing into a corner,” “instilling fear,” 
“negation,” etc. Certain of these oveilap, others aie names of 
devices to be used only in unusual situations Suppose a subject’s 
story has all the earmarks of plausibility but contains one point 
markedly inconsistent with other facts By decisively singling 
this out the subject’s whole story can be “punctured ” Harsh 
measures for putting the subj'ect in a corner, instilling fear, taking 
him off his guard, and the like should be avoided when possible 

Keeping the interview to the main issue In the interview an- 
other set of techniques is necessary to ensure keeping to the 
main issue At first the subject may be allowed to ramble until 
he has exhausted the high pressure of his pent-up emotions 
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through the safety-valve of free expression Sometimes as he 
does this a canny subject may set out giadually to veer away 
from the issue by planned digressions In any case the time 
comes in every interview when the main issue should no longer 
be dodged 

One of the simplest and most direct methods of keeping the 
interview to the topic is by the use of questions When the nar- 
rative IS finished, the interviewer proceeds to fill in the gaps and 
uncover the obscurities by direct questions If the mam issue is 
still sensitive, then the probing may be so indirect and along 
collateral lines that it gradually breaks through the outer de- 
fenses by disclosure after disclosure until the resistance to the 
main issue is weakened and finally overcome 

Salsberry (57) mentions other devices, some of which are mere 
repetitions of techniques used for other purposes Failure to 
answer digressive questions put by the subject may help to main- 
tain the line of advance. It takes two to keep a conversation 
alive, and the interviewer can cause some topics to expire by 
merely failing to respond to them. “Sharing personal experiences 
of the same nature as the mam issue” is a device which helps to 
keep the thoughts directed without coeicion to the point of chief 
interest 

Questioning The technique of asking questions is the master 
key to the art of interviewing Although the subject is given 
free rein at the start to tell his own story, the interviewer must 
eventually complete the interview by carefully selected and well- 
framed questions Questions may be asked to obtain new and 
unexpected information which the narrative did not yield, to 
obtain statements either corroboratory or contradictory as a check 
on the original narrative, to ascertain certain names, dates, loca- 
tions, etc , which were not included m the original statement, 
and to obtain a more complete picture of the informant’s asso- 
ciations, biases, outlook, prejudices, motives, and interests 

Meltzer (39), m his study of social concepts, asked questions 
to eke out the generalizations offered with more detailed state- 
ments If a pupil referred to democracy as “government by the 
people,” then further questioning might reveal that he meant a 
form of government, a form of rule, a form of group activity, or 
whatnot. 

The degree of difficulty of questions used is obviously extremely 
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important. Vocabulary and sentence structure must be within 
the grasp of the peison who is to answer. One of the major 
causes for misinterpretation of a question is that inquirer and 
respondent do not have in mind the same definitions of its 
terms. Before we suspect a person of untruthfulness, we should 
first make sure that his understanding of the questions coincides 
with our own. 

Of all questions the suggestive or leading question is most 
liable to produce erroneous answers In fact, suggestion is the 
greatest obstacle that the interviewer must surmount in obtaining 
facts. It Is extremely easy so to word the question, or give it an 
intonation by speech or gesture as to suggest the answer which 
the interviewer expects or would like Well-substantiated work on 
the psychology of testimony shows that the leading or suggestive 
question decreases accuracy of response People are open to sug- 
gestion and easily adopt the answers that the form or manner 
of the question makes most easy Because children are particu- 
larly prone to yield to leading questions, this type of question 
should be particularly avoided with them There are occasions, 
perhaps, when the suggestive question may be used If the inter- 
viewer has reason to believe that the subject is falsifying his 
testimony, he may try by the skilful use of suggestion to make 
him contradict himself and thus expose the falsehood This 
method is employed extensively in legal practice, where evasion 
is common. 

Choice of questions. In the choice of questions lies the core of 
interviewing — its soundness and effectiveness or its falsity and 
weakness It might seem as though it would be a comparatively 
easy matter to ask questions to help gain insight into a problem 
in conduct, but in practice to choose between relevant and ir- 
relevant questions taxes one’s utmost knowledge and skill The 
information sought must be pertinent to the factors and causes 
that have been potent in guiding affairs into the present impasse 
Knowledge is needed not only of general correlations between 
social phenomena but of the forces acting in the particular case 

The difficulty of selecting pertinent questions in the interview 
is most apparent in employment interviewing The candidate for 
a position as teacher, for instance, comes in to be interviewed 
by a superintendent of schools. A whole range of questions could 
be asked concerning training, experience, knowledge of teaching 
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skills, religious and political affiliations, hobbies, and interests, no 
one of which is known to have any substantial correlation with 
teaching ability A teacher may be deficient m one or more of 
these factors and still be a good teacher, or may measure up 
to each of these requirements and fail as a teacher. Knowledge 
of professional procedures may be counterbalanced by a lack of 
understanding of or sympathy with children The field of em- 
ployment psychology has developed a number of selective devices 
to be used in lieu of anything better which have frequently been 
governed by mere absurd hunches and prejudices such as an- 
tipathy to blondes, or to members of a certain race, or to those 
with peculiarities of speech As a result, choices of employees have 
been no more successful than chance It seems a pity to have 
to admit that teachers selected on the basis of attractiveness of 
personality have proved practically as satisfactory as though they 
had been chpsen on what would seem to be a sounder basis, pro- 
vided of course that they possess a minimum of intelligence and 
training * 

This discouraging admission does not mean that significant 
questions cannot be framed nor that the interview method is 
worthless for all purposes It does mean that the choice of ques- 
tions IS not the obvious matter it seems Questions must be asked 
after detailed study of their significance. Furthermore, the inter- 
viewer must himself have had a long and successful clinical 
experience, and be alert to supplement the interview by objective 
techniques demonstrated to be reliable and valid 

The selection of questions by systematic study is described in 
the chapter on questionnaires Though this scientific study of the 
significance of questions seems to point toward the use of pre- 
pared question schedules and the standardization of the inter- 
view, generalization is dangerous, and the method used should be 
determined by the purpose of the interview In clinical practice 
It is customary to diaw questions from a rather lengthy schedule 
with the hope of uncovering significant areas where the trouble 
may have originated and then to follow up these clues with more 
pointed questioning 

An interesting discussion has appeared in the literature of social 
work concerning the significance and value of the non-verbal ele- 

*Tiegs, E W, An Evaluation of Some Techniques of Teacher Selection 
(Public School Publishing Company, 1928) 
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ments in an interview A person expresses himself in many other 
ways than by words. The intonation of his voice, facial expres- 
sion, flushing, bodily attitudes and bearing, and gestures are sig- 
nificant because these responses indicate emotional tensions, com- 
plexes, and inhibitions The interviewer should be aware of these 
subtle responses, and quick to note the place where they occur in 
the development of the story Often it is possible to learn more 
by inference from these gestures than from direct verbal ex- 
pression Queen (51), who has biought to the attention of so- 
ciologists the impoitance of these subtle factors of expression, 
remarks that no single gesture may have much significance by 
itself, but all may have meaning when taken together The inter- 
viewer should also pay particular attention to their concomi- 
tants Evasions, confessions, stuttering, shrugs of the shoulder, 
sharp glances, wandering of attention, sudden shifts of the topic, 
rubbing the hands together, and shifting the position in the 
chair may not be irrelevant byplay but genuine modes of re- 
sponse to the topic under consideration On the other hand, of 
course, these responses may indicate lack of adaptation to the 
interviewing situation rather than to the topics being discussed 

In the training of social woikers for interviewing, the sugges- 
tion has been made that students be given definite practice in 
observing and recording these non-verbal factors. Those with long 
experience in interviewing, however, dissent from this proposal. 
They believe that attention to these modes of response in the 
end defeats its purpose by withdrawing the interviewer’s atten- 
tion from the personal and social problem encountered, and pre- 
vents full consideration of motives, mental mechanisms, and re- 
lationships between different parts of the story The non-verbal 
factors in the interview should be obseived and noted, but atten- 
tion to them should not usurp the place of attention to the more 
vital issues in the problem itself As in any skill, attention to 
the process may distract one from its end or product The process 
should properly be automatic, with attention free for fixation on 
the goal. 

Climax and denouement. The time will come in an interview 
when one feels that “the secret is out,” the confession has been 
made, and the irritating sources of conflict have been uncovered. 
Or, on the other hand, one may have used every appeal and 
touched on every possible source of conflict without being able 
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to break down inhibition or to establish confidence. In the first 
case one must not be hasty in jumping at conclusions The “se- 
cret” may be the wrong secret, or it may not be the whole seciet 
The successful interviewer does not depend on revelations, 
hunches, and coincidences His method exemplifies the thorough- 
ness, system, and caution which results in tentative conclusions 
In some cases it may be desirable and necessary to come to a 
conclusion immediately The physician must not only diagnose 
but prescribe. In case of doubt, however, the best results come 
from postponing the decision until a fresh and impartial survey 
can be made when all the possibilities have been considered 

Rademacher emphasizes that not too much can be expected 
from a single interview. One should not be at all discouraged if 
the interview leads to no distinct conclusion Sampling again is 
a factor here. A client has been seen m only one situation during 
a single interview New forces may be at work at the next sit- 
ting It IS a queer thing that inhibitions which persist with 
tenacity throughout a single interview require only the lapse of 
an interval for relaxation to occur As soon as the subject is away, 
he thinks of what he might have said and is thus prepared to 
meet the interviewer more than half way, perhaps, at the second 
meeting. 

Conclusion. The conclusion usually consists of mapping out a 
plan of action, but since we are concerned only with the diag- 
nostic phase of the interview this will not be discussed here. One 
should remember, however, to terminate the interview so that 
relations may be resumed later if desired Interviewer and in- 
terviewee should part as friends. “I am glad to have had this 
opportunity of talking things over with you” or “Let me know if I 
can ever be of help to you” or “Now that we are friends, come 
in to see me often,” together with a hearty handclasp, helps to 
terminate the interview favorably. 

Whatever the purpose of tire interview, to be of most value 
it should probably be followed up by others Healy (30) sug- 
gests several short interviews so as to avoid fatigue or boredom 
and so that the interviewer may observe the subject on several 
different occasions when variations in mood can be noticed It is 
well known that a first interview is often unsatisfactory and 
that resistance broken in the initial interview may yield good 
results later. 
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Accuracy of Interviewing 

General factors related to accuracy of testimony. Several 
decades ago there was a considerable wave of interest and research 
in the accuracy of observation and testimony Summary reviews 
of their earlier work may be found in Whipple’s reports (67, 68, 
69, 70, 7ij 72} 73} 74) on the “Psychology of Testimony and 
Report” in the Psychological Bulletin Some of the pertinent con- 
clusions reached at that time as they pertain to interviewing will 
be briefly sketched here. 

Interrogation leads to more error both in range and accuracy 
of report than does simple narration. Eye-witness evidence is 
more reliable than hearsay evidence Repeating a report tends 
to draw out memories of the original report rather than of the 
experience itself, so that errors in the original report are per- 
petuated Any discussion of experiences with others tends to warp 
and distort them The most common errors of memory are errors 
of omission, rather than errors of accuracy. In general there is 
less error on things which are said to be very certain The time 
interval between an experience and its report increases the in- 
accuracy of the report The accuracy of testimony can be im- 
proved by practice. Secondary qualities such as colors and num- 
bers can be reported less accurately than primary factors For 
this reason the description of an individual’s appearance is usu- 
ally worthless While excitement tends to improve observation and 
memory up to a certain point, beyond that point it impairs them. 
Excitement tends to cause misunderstanding and error m giving 
testimony. Men are more accurate but give less extended reports 
than women Children are less accurate than adults. The intel- 
ligent are more accurate than the less intelligent. 

All of these statements show tendencies or correlations rather 
than unvarying principles Most of them were determined before 
our present accurate methods of measurement and expression in 
terms of correlation were available, so that an elaborated repe- 
tition of the work would be desirable. Two primary sources of 
error hardly mentioned in this earlier work are (a) a tendency 
to conceal on the part of the informant and (b) errors of in- 
terpretation on the part of the interviewer 

As a general rule the subject will not say anything to incrimi- 
nate himself. If he is seeking a position, Jie will not tell of his 
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past failures or discharges The counselor should face this fact 
definitely and omit possibly embarrassing questions concerning 
previous service, for only favorable evidence will be revealed 
Pupils in school can hardly be expected to tell about their own 
cheating, failure to study, or bad companions, nor will they admit 
that they are shunned by their classmates or that their parents 
are cruel People dislike to report evidence that will get them 
into any fuss Most persons hesitate to report speakeasies, im- 
morality, traffic violations, and accidents for fear of getting mixed 
up in the matter The workman will not report dissatisfaction 
with his previous job for fear it will make him less acceptable 
in his new undertaking 

Again it is only too often true that the person being questioned 
will attempt to give answers that he thinks will please or satisfy 
the interviewer He studies the interviewer’s biases and point of 
view and tries to make his answers harmonize with them so 
that he may earn the interviewer’s esteem It is not merely a 
jest that students attempt to give answeis in oral examinations 
which they think will please the instructor rather than portray 
the facts -as they believe them to be Since the person being in- 
terviewed IS concerned about himself and governs his story ac- 
cordingly, the interviewer must make him believe that he is serv- 
ing his own interest best by telling the truth. 

Finally interviews are liable to error because the interviewer 
must interpret the subject’s story In drawing conclusions from 
the evidence gained in the interview, the interviewer should be 
extremely cautious and free from preconceptions Perhaps the 
one most potent cause of erior in the interview is that most in- 
terviewers start out with a handful of hypotheses uppermost in 
their minds and by which they are led to select evidence to fit 
in with then preconceived views. A case of neurosis is variously 
judged to be merely a set of bad habits, a condition of the cerebral 
hemispheres, a sex phantasy, or an infantile regression by so 
many different psychiatrists who sponsor these particular hy- 
potheses The interviewer must be extremely careful not only in 
what questions he asks but in what answers he rejects as trivial 
or unimportant. Every possible hypothesis should be considered, 
and rejected only when the evidence does not give it support 

Clark (21) points out that the reticent student often tantalizes 
the interviewer by suggesting clues which he refuses to elucidate. 
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Thus a frequent temptation is for the interviewer to elaborate 
hypotheses out of insufficient evidence, h)’'potheses which so soon 
gather a halo of plausibility that they become confused with 
actual evidence and are mistaken for it Once hypotheses and 
evidence become confused in seeking a solution to a problem, it 
may happen that no distinction between them is again perceived. 
Every diagnosis is a hypothesis, to be sure, but in reaching some 
diagnoses the mind travels too far ahead of the data at hand. 

Many interviewers become prejudiced on the flimsiest of ex- 
cuses. The vocational counselor rejects a candidate who has a 
nose like that of a disliked classmate or accepts one whose voice 
reminds him of an old nurse Or he gives an important interpre- 
tation to a certain hesitancy, an inability to look a person in the 
eye, a slight cough, or a tendency to volubility — each insignificant 
in Itself but unconsciously made the basis for a decision 

Again the interviewer must judge the degree to which his ques- 
tions are understood and comprehended by the subject and must 
interpret the answeis in this light Often answers arc considered 
to be evasions merely because the subject did not fully compre- 
hend the question. 

In general an interviewer fails to give either enough credit or 
enough blame to those deserving it The tendency in interviewing 
is to make judgments reach a rather dead level Claik (zi) found 
that interviews with college students made possible prediction 
of their grades with a correlation of 66 and .73. But the vari- 
ability of the interviewer’s ratings was not as great as the vari- 
ability of the actual marks Good students were recognized as 
good, but were not judged as bright as they piovcd to be Poor 
students were rated as poor, but the extent of their failure was 
not recognized 

Statistical evidence of the reliability of interview's is almost 
non-existent. Intel viewing has not been subjected to experimental 
scrutiny or statistical validation Interviewing has resisted stand- 
ardization, and many who find value in interviewing believe that 
to reduce it to a form of measurement is to reduce its usefulness. 
It IS difficult to impress some enthusiasts with the unreliability 
of testimony. Such people, and they are the majority, prefer to 
rely on their own biased, cramped, inaccurate observation and 
judgment rather than to resort to any more deliberate, systematic, 
and objective methods of collecting data. The convenience of the 
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interview can be no substitute for accuracy and dependability 
of results Tntci view mg must be subjected to the same standaids 
of reliability that other methods of collecting data are now sub- 
jected to 

The correlations found by Clark (21) between intervieweis’ 
estimates of students’ ability and their actual marks ( 66 and 
-(-73) indicate that intervievi mg possesses some value An in- 
teresting experiment has been reported by Baines and Pressey 
on the value of oral examinations conducted in a college class. 
The class was split up into committees, and each of six candi- 
dates was examined in turn by the committees This experiment 
showed an average correlation of 30 between the ratings of dif- 
ferent committees. The average correlation in this instance with 
class marks was 47. 

Industrial psychologists have demonstrated again and again 
the impossibility of selecting employees by interviewing, of what- 
ever kind The following experimental results taken from Hol- 
lingworth (31) are typical. 

Fifty-seven applicants for salesmanship positions presented 
themselves for examination by a variety of methods, 

“Twelve sales managers agreed to interview the applicants in- 
dividually and to rate them for their suitability for the position 
in question The interview procedure was not prescribed but was 
left to the dictates of the interviewer Each judge was required 
finally to rate all 57 candidates according to suitability, and these 
ratings were then transcribed into rank from I to 57 The fol- 
lowing table shows typical results ” 

Table 84 

Ranks Assigned Applicants by Interviewers 
(from Hollmgworth, 31, p 1 16) 

SALES UANACERS 


Apphcaia 

I 

a 

J 

4 

5 

6 

7 

5 

9 

la 

JI 

12 

A 

33 

46 

6 

S6 

26 

32 

12 

38 

23 

22 

22 

9 

B 

36 

5° 

43 

17 

51 

47 

38 

20 

38 

55 

39 

9 

G 

53 

10 

6 

21 

16 

9 

20 

2 

57 

28 

I 

26 

D 

44 

25 

13 

48 

7 

8 

43 

II 

17 

12 

20 

9 

E 

54 

41 

33 

19 

28 

48 

8 

10 

56 

8 

19 

26 

F 

18 

13 

13 

8 

II 

IS 

IS 

31 

32 

18 

25 

9 

G 

33 

2 

13 

16 

28 

46 

19 

32 

55 

4 

16 

9 

H 

13 

40 

6 

24 

51 

49 

10 

52 

54 

29 

21 

S3 

I 

2 

36 

6 

23 

11 

7 

23 

, '7 

6 

5 

6 

9 

J 

43 

II 

13 

II 

37 

40 

36 

46 

25 

IS 

29 

I 
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Applicant C is given fifty-seventh place by one sales manager 
and fiist place by anothei Applicant G is given fifty-fifth place 
by one and fourth place by another The entire lack of consistency 
shows the uttei impossibility of reaching agi cement on personal 
qualifications through the interview method Hollmgvv orth em- 
phasizes that these ratings were made by expert sales managers 
who were experienced interview'ers. 

Recording results. It seems to be rather generally agreed that 
note-taking during the interview is inadvisable. Note-taking is 
diverting to both the interviewer and the person interviewed. The 
subject’s confessions are likely to be constrained if he feels that 
what he says is being recorded An interviewer may be excused 
if he stops to record dates, names, and addresses wdiich would 
be likely to escape his memory However, the full report of the 
interview should be written down as soon after the interview as 
possible Even if the interviewer wishes to dictate to a stenog- 
rapher the report of a home visit on returning to his office, full 
notes should be taken immediately after the interview Important 
details may be forgotten in a few minutes unless jotted down 

Many writers urge that tlte interview be recorded with the 
exact language used So far as possible the report should be in 
the first person with exact quotations of the conversation The 
answer to a question should be preceded by the question itself. 

To be most helpful, the interviewei should attempt to include 
his reflections and impressions of the interview', being careful 
that these may be shaiply distinguished in the report from the 
actual words spoken The setting and background of the interview 
must be described The subject’s bcaiing, clothing, mannerisms, 
and approach should all be recorded The report should reflect 
the accents and emphasis of the pci son being interview'ed. 

Recently social workers have been experimenting with more 
elaborate forms for reporting interviews The “process interview” 
attempts to record not only the overt happenings and events tak- 
ing place during the interview, but also the result of probing un- 
derneath the surface to describe the psychological forces at play. 
Elaborate forms have been devised consisting of parallel columns 
in which may be placed not only the events and conversations 
in the interview but the motives and purposes which underlie 
each question put by the interviewer and each answer by the 
subject. Such a report attempts to show in high light the mechan- 
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isms that carry the interview forwaid T^is form of report is 
believed to be of aid m tiainmg mtervieweis by making them 
aware of then piocedure and helping them to take each step 
in the light of the psychological foices working and the ends to 
be gained. 

This “process interview” repoit may be criticized on several 
grounds. In the first place, n hatever values it may have for train- 
ing, it is a lengthy and difficult report to make and cannot be 
considered as practicable for clinical pui poses In the second place, 
from our know ledge of the difficulties _ of divining motives and 
mental mechanisms, it seems extremely doubtful that most of the 
interpretations can be more than guesses The piactical worker’s 
desire for insight and interpietation must constantly be checked 
by the psychologist’s demands for accuiacy and reliability The 
wish to be able to. divine undeilying motives in the interview 
should not overweigh the knowledge that such interpretations 
are to be considered in most cases as mere speculations 

The literature of social woik also refers to the “diagnostic 
summary” as an integral part of the repoit This summary gives 
the mteiviewer’s interpretation of the interview. It emphasizes 
the focal points and organizes them in such a way as to present 
a connected analysis and picture of the situation Such an inter- 
pretation or “diagnostic survey” ought to be included in the re- 
port of every interview But this summary should be frankly 
recognized as a subjective interpretation — a hypothesis the value 
of which depends solely on the adequacy of the data on which 
it IS based and the interviewer’s sagacity in drawing conclusions 
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Chapter XIII 

PSYCHOANALYSIS 


I N psychoanalysis we have reached the frontier techniques for 
the diagnosis of conduct In this region much is uncharted, 
much unknown Psychoanalysis is the antithesis of standard- 
ization The method used is free association — ^peifectly fice, with- 
out even a list of stimulus woids to leact to The physician in 
making his analysis must depend upon all the finesse of judgment, 
sagacity, and interpretative ability which he can command Psy- 
choanalysis IS an ait, not a science In psychoanalysis one cannot 
give a test, scoie the results, and turn to a manual foi its inter- 
pretation If psychoanalysis succeeds, it is thiough insight derived 
from the careful study and wide cxpeiience of the analyst, if it 
fails. It is because the physician cannot measure up to the demands 
of mterpietation and insight placed upon him. 

If one becomes confused in tiying to understand psycho- 
analysis from the expositions of it in manuals and tieatisc’., the 
reason may he paitly m the fact that psychoanalysis has a three- 
fold nature — it is first of all a theoty of human behavior, second 
a technique of diagnosis, and third a theiapeutic measuie These 
three phases of psychoanalysis are not always cleaily distin- 
guished Most of those writing about psychoanalysis arc interested 
in contributing to its structuie as a thcoiy of human nature, while 
those who practise it wish to cure mental disease In effecting this 
cure they make inquiries into the intimate personal wishes and 
inhibitions of the patient and interpret the resulting revelations 
in the light of the psychoanalytic theories familiar to them If 
these interpretations aie accepted by the patient as being true, 
the cure is effected in this very act of acceptance, apparently by 
helping the patient to integrate reaction systems which cannot 
otherwise operate in harmony 

Psychoanalysis makes use of many of the techniques described 
in the chapter on interviewing Ceitain of the principles developed 
in the chapter on the free association method are also utilized. 

485 
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Psychoanalysis, therefore, is the most completely integrated 
method among all the methods considered in this book. Yet psy- 
choanalysis as a technique is distinctly private, personal, indi- 
vidual. 

Psychoanalytic Theory 

The technique of psychoanalysis can hardly be understood 
without some knowledge of psychoanalytic theory At the risk 
of condensing a summary of psychoanalytic theory to the point 
where it will become misleading, we shall undertake a brief 
theoretical survey before proceeding to a discussion of the tech- 
nique. Although there is more than one theory, the mam stream 
of psychoanalysis derives its conceptions from its founder, Freud 
Diverging from Freud’s are variations in theory sponsored by 
Jung and Adler, with minor variations elaboiated by Rank, 
Ferenczi, and others. 

One major difficulty in comprehending psychoanalytic theory 
is the vocabulary A weighty structure of technical terms has been 
erected which effectively obscures its import from the uninitiated 
Part of our task then is that of providing a glossary of psycho- 
analytic terminology 

We are asked to recognize first of all that the driving forces 
in the human diama are internal Any organism has certain cellu- 
lar needs and organic irritations and instabilities that constitute 
the driving forces to behavior Most of these irritations and in- 
stabilities require some sort of a readjustment with the outer 
environment in order to effect a reduction of the irritation or a 
state of greater stability Hunger and sex are the two most im- 
portant illustrations of these internal driving forces in man The 
driving force of sex, which is a real physiologic force, has been 
termed by Freud the libido. 

This libido or driving force demands contact or readjustment 
with the outside world for alleviation and satisfaction Hunger 
requires food in order to be satisfied, and the individual must go 
out and seek his food The demands for sexual gratification re- 
quire, m their unspoiled state, outer contacts for their satisfaction. 

The original means for satisfying these imperative organic de- 
sires m the infant occur long before the development of language 
or before experiences are apprehended by means of symbols or 
images. In other words, oJiey are unconscious. They are instinctive 
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if you like, or represent very rudimentary forms of learning, but 
they occur without any conscious apprehension on the part of the 
baby Psychoanalysts make much of the fact that our earliest re- 
actions are unconscious — the marvel is that some of our reactions 
are attended by awareness. The overwhelming number of our 
reactions and adjustments are unconscious, consciousness repre- 
senting a more superficial froth on the deeper currents of our 
daily adjustments Practically all of the tremendous amount of 
postural adjustment that takes place in boxing or playing a vigor- 
ous game of tennis pioceeds without conscious direction. 

Freud was one of the first to emphasize the fact that even the 
infant has sexual drives which require and eventuate in sexual 
adjustments Early manifestations of the sexual interest in the 
infant come fiom the satisfactions resulting from excitation of the 
so-called erogenous zones, seen first in the satisfactions of suck- 
ling and later sucking Equally potent sources of satisfaction by 
the stimulation of erogenous zones appear in defecation and 
urination. Still later in life a similar satisfaction may be aroused 
in connection with stimulation of the genitals. 

Freud is fond of pointing out that as sexual urges develop and 
find their own appropriate and unique means of gratification, these 
expressions tend to become fixed or fixated. Once a person has 
found a satisfying mode of sexual activity, it is practised until it 
becomes habitual, and there is no tendency to seek alternative 
objects of desire or alternate modes of satisfaction Other funda- 
mental drives, such as hunger, are satisfied only by adjusting 
to a changing environment, so that they become amenable to 
education But the sexual urges often derive their satisfaction 
within the body itself A baby may suck its own thumb Again the 
sexual desires are satisfied in the course of other activities, as 
when eating, disposing of bodily wastes, and the like In other 
words, they are self-contained, insulated, or parasitic and hence 
lemain more or less immune to educative processes. 

Another phenomenon emphasized by the Freudians is the taboo 
placed by society on all sexual activity except that leading to re- 
production in wedlock. As Freud (9, p 269) says: 

“One of the most important educational tasks which society 
must assume is the control, the restriction of the sexual instinct 
when It breaks forth as an impulse toward reproduction, it must 
be subdued to an individual will that is identical with the man- 
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dates of society . . . Experience must have shown educators that 
the task of guiding the sexual will of the new generation can be 
solved only by influencing the eaily sexual life of the child, the 
period preparatoiy to puberty, not by awaiting the storm of 
puberty. With this intention almost all infantile sex activities 
are foibidden to the child or made distasteful to him, the ideal 
goal has been to render the life of the child asexual.” 

Consequently we find, so far as the sex life goes, two compet- 
ing systems one, the fundamental sexual drives with their fixated 
modes of satisfaction, the other the scruples, taboos, restrictions, 
and repressions established by the societal moies toward all mat- 
ters sexual 

These competing systems, the one the responses in quest of 
satisfaction to the fundamental sexual urges, the other the teach- 
ing of society, lead in later life to bitter conflict and even become 
the root of neuroses. For we find that not only are the primary 
sexual satisfactions born in unconsciousness, but they are kept 
unconscious, are prevented from rising to consciousness, and are 
~epiessed as unsuitable for consciousness in being vulgar, lurid, 
and sinful Freud has created the fiction of a censor in order to 
dramatize this inhibitory power of the conscious life best repre- 
sented by those ideals of society which aim to prevent an indi- 
vidual from directing his attention to his sexual activities The 
individual puts them out of mind, just as in the World War the 
power of propaganda was so strong’ that charitable feelings toward 
the enemy were not even considered, to say nothing of being 
definitely entertained But the sexual life still persists in its 
modes of giatification which, because they are not accepted or 
recognized as such, appear in reactions grotesque, bizarre, and 
unexplainable A neurosis has developed whose cause the indi- 
vidual afflicted cannot even remotely surmise. 

The individual, however, must continue to live even though 
burdened with two reaction systems which are thus opposed to 
each other. The relation of an individual to objects or persons 
toward which he reacts sexually without the sanction of the con- 
scious verbal organization indicates the presence of what is known 
as a complex Complexes exhibit themselves in two characteristic 
ways, one by a strong charge of emotion, the other by a reticence 
or inhibition which causes the individual to repress the overt ex- 
pression of the sexual nature of the relation. Usually complexes 
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are thought of as responses to words or Ideas which have an 
emotional tinge. 

Certain of these complexes have been given special names 
The cedipus complex (so-called from the old Greek myth) con- 
sists of the tenderness of a son toward his mother, harmless 
enough on the surface but with far-ieaching consequences. A 
corresponding complex called the Elect! a complex relates to the 
tendeiness of a daughtei toward the father The castration com- 
plex IS the belief, early acquired by a child, that in some way 
his genital organs are inferior or impotent, a fear that they may 
become so, or the fear which results when reprimanded for hav- 
ing committed some sexual perversion such as masturbation 

At this point in our discussion of psychoanalytic theory we 
ought to introduce a number of case studies These theories of 
early sexual expciicnce, fixation, repiession, the complexes, and 
the like are so abstiact and even bizaire when described that one 
would never dream how the psychoanalyst can sec evidences of 
their existence in everyday affairs But their explanations pene- 
trate to the warp and woof of all common experience in ways 
that are uncanny Many of the interpretations of everyday human 
experiences made in the light of psychoanalytic theory would 
never occur to the novice unless specifically pointed out. 

The methods used in inhibiting sexual experiences, and in at- 
tempting to explain or justify sexual behavior so that it does not 
appear to have a sexual significance, have led to the discovery of 
a number of behavioral mechanisms. These may be mentioned 
briefly 

Suppression refers to the deliberate concealment or submerg- 
ence of a reaction tendency which is socially unacceptable Later 
this deliberate attempt to ignore or forget socially unacceptable 
reaction tendencies may become habitual, whereupon it Is called 
repression 

Rationalization is the attempt to justify an unacceptable mode 
of reaction by logic, argument, or reason. One tends to explain 
away an inconsistency in behavior by reasons that are trivial or 
beside the point The rationalizer is a person searching for props. 

When one tries to explain to a person the real cause of his in- 
consistency, his error, his forgetfulness, or his peculiaiity, the 
attempt is met by resistance. Not only is the true relationship 
of the experience repressed, but there is» resistance toward re- 



490 Diagnosing Personality and Conduct 

spending to it fully. We refuse to listen to or harbor the thought 
that would indicate that our reactions are not really in harmony 
with our piofessions. There is a resistance which keeps two re- 
action systems apart 

One of the interesting ways by which sexual reactions lose 
their original direction and acquire new objects of allegiance is 
known as transfeience. This is probably nothing more tlian a par- 
ticular instance of the laws of learning at work in the process of 
redintegration or conditioning The best example of transference 
is the lavishing of affection on some individual who resembles in 
some way another whom we alieady love and respond to sexually. 
But this transference may be negative as well as positive, and 
dislike or disgust can be transferred as well as affection. 

Regression is a term used to describe reversion to an old and 
successful mode of reaction in a new situation where resistance 
IS encountered. The youth entering business where competition 
is keen and discouraging maintains his collegiate interest in sport 
as the method of outlet which has yielded him the most recogni- 
tion in the past Sometimes the regression is back to childish or 
even infantile forms of behavior, so we are told by the Freudians, 
when the difficulty to be faced by an individual is so severe that 
no other alternative gives the requisite satisfaction 

Two o,ther mechanisms are known as identification and projec- 
tion. In identification one puts himself in place of another person 
and reacts as he imagines the other person would act or imagines 
himself acting as the other person (maybe a fictional character) 
does act. The small boy or girl often wull attempt to imitate father 
or mother, or will live through experiences as though he were 
a real robber or Indian, or she a mother or teacher. Indeed, 
identification in imagination is one way we gratify our vrishes for 
which direct overt gratification is impossible 

Projection, on the other hand, is the interpretation of the acts 
of others by attributing to them feelings or evaluations which one 
possesses himself but which he either fails to recognize or resists 
recognizing One may ascribe the snobbishness or unfriendliness 
to others which in truth characterizes better his own behavior. 
One may accuse others of being gossipy or critical when he ex- 
periences difficulty in inhibiting these very tendencies in himself. 
A pupil in school asserts that his teacher gave him his mark when 
in fact he determined it himself by his own achievement. 
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Another mechanism is sublimation, the draining-ofF of some 
organic desue, such as sex, into channels of activity that receive 
social recognition. In adolescence, particularly, when the ripening 
reproductive functions add their urge to the other sexual drives 
that have persisted through childhood, some sort of sublimation 
is necessary. Society puts a ban on direct gratification e.xcept 
through the recognized foim of wedlock In the meantime youth 
must seek satisfaction thiough sport, dress, art, religion, parties 
and dances, and other activities which may temporarily act as 
substitute satisfactions for the more direct physical gratification 
Wheie an attempt to gratify a drive is abandoned altogether and 
attention is directed toward some other source of satisfaction, the 
mechanism is called compensation The girl who cannot gam at- 
tention through good looks or attracts e clothes may try to gain 
recognition as a good student The boy who cannot achieve dis- 
tinction in a fraternity may go in for sport or making money. 
In many ways one may attempt to compensate for a real or 
imagined inferiority by substitute activities. 

Then again defense and escape mechanisms are referred to, 
methods of escaping from reality by withdrawing into one’s own 
self in imagination or day-dreaming or methods of defending one- 
self against some inferiority by passing as a tough, hard-boiled, 
dangerous youngster. 

The Freudian psychoanalytical theory has received severe criti- 
cism from orthodox psychologists It is true that since as a theory 
It has not received scientific verification by experimental proof, 
it must still be considered a hypothesis But as a theory it is ex- 
tremely lusty, and there are abundant indications that it should 
be considered as a promising hypothesis even in scientific circles, 
worthy of being submitted to all of the tests that vigorous thought 
can propose As we shall see when we discuss psychoanalytic tech- 
nique, the very nature of the theory, bound up as it is with our 
repressions and resistances, makes it a difficult one to test by 
the usual experimental approaches. 

The Freudian emphasis on sex as the ultimate source of prac- 
tically all mental conflicts is the one feature of psychoanalysis 
which is most criticized by orthodox psychologists. Irrefutable 
evidence justifying the criticism seems to have been obtained 
by Rivers, whose experience with functional neuroses de- 
veloped on the battle-field dqring the World War led him to 
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believe that fear, not sex, was the driving force involved in these 
cases. 

After all, repression rather than sex is the corner-stone of psy- 
choanalytic theoiy. Freud’s contribution to our knowledge of 
human behavior lies in showing how the first and natural modes 
of expression for normal human needs and drives may become 
submerged beneath a veneer of custom and respectability which 
alone is socially acceptable So far as his theory goes, any per- 
sistent drive that becomes repressed may lead to a neurosis In 
the World War, normal fear reactions which had been stamped 
m by years of experience were socially unacceptable When the 
soldier did not dare to exhibit fear on the battle-field with the 
whole country behind looking on, regressions and other mechan- 
isms of severe neurosis developed in consequence In everyday 
life such repressions without any sexual reference are going on 
continually When the child who peisists in asking his mother for 
some toy for the possession of which he has envied a playmate, is 
answered, “Don’t mention that again,” he is repressed exactly as 
IS the child who is shamed from continuing some sexual gratifica- 
tion 

Probably F reud is on the whole not far wrong in claiming that 
in all of his analyses the clues led eventually to a sexual complex. 
In normal society no other drive is repressed in a wholesale way, 
even from thought and conversation, as is sex Few other drives 
have quite the same biological persistence and endurance No 
other drive is so limited in its modes of gratification That sex is 
to be found the basis of the overwhelming majority of neuroses 
is not to be wondered at when one considers how completely so- 
ciety for Its own protection keeps sex submerged below the 
thought and action current of daylight affairs But the same 
mechanisms of repression and the same mechanics of substitution 
are to be recognized in many of the affairs of everyday life. 

A rival school of psychological theory — ^“individual psychology” 
— has been developed by Alfred Adler, originally a disciple' of 
Freud, but long since apostate Adler was one who could not find 
a sexual source for every neurosis He developed the theory of 
compensation for “organic” inferiority, but later broadened his 
theory to include all kinds of feelings of inferiority or insecurity, 
both real and imaginary, as a basis for nervous disorders. A 
priori, It is not so easy to envisage the source of energy for a 
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“feeling of inferiority” which could result in abnormal behavior, 
as it IS for sex with its purely physiological basis. This necessity 
for social appreciation, however, is a drive which becomes condi- 
tioned very early in life. 

“Each of us, apparently, has built up an intricate series of 
responses to the way in which other people flatter, ignore, or dis- 
parage our ego or personality These responses have their roots 
in infancy Our first acts are directed toward alleviating such re- 
current demands as hunger, thirst, sex, drowsiness, and cold 
Some pel son, usually the mothei or father, is always concerned 
with the production of the original satisfaction, so that we learn 
early to respond to other people Habits are built up of ciaving 
their presence and attention, and of dominating or being domi- 
nated by other people in order to achieve what is desired And 
parents, meanwhile, are employing a variety of forms of expres- 
sion such as smiling, scowling, and later flattery and scolding, as 
they provide or withhold these satisfactions, these forms of expres- 
sion tend to become associated with the acquirement of the satis- 
factions The net result is that these expressions and actions on the 
part of other people tend to become associated with our satisfac- 
tions and deprivations By the time we become adults these 
stimulus-response situations have become so generalized that the 
lesponses which were originally very definite expressions of satis- 
faction and annoyance at obtaining or not obtaining what we want 
have melted into a vague exaltation or depression of the ego or 
personality ” (20) 

One might say, then, that Adler’s theory does not go back to the 
beginning, and Fieud claims that when traced back ultimately all 
neuroses have a sexual origin Wc can grant that this controversy 
IS often trivial, at least to some extent Ccitainly some of the psy- 
choanalytic literature produces explanations that are so far-fetched 
and bizarre as to be palpably ridiculous Every case of neurosis 
presents a picture of maladjustment when tiaced far enough, and 
the clinical diagnosis should be considered adequate when the 
source of the maladjustment and the reason for the repression or 
inadequacy is disclosed Usually it is not necessary to seek far 
for the source. 

Technique 

The techniques for conducting a psychoanalysis are many and 
diverse Each practitioner has developed his own variation Since 
most analysts have medical training, the sybject is received as a 
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patient. The analyst is alert to detect the working of subtle mental 
mechanisms since his endeavor is to Interpret and control every 
contact with his patient One writer recommends that patients be 
dismissed by an exit which does not lead through the waiting 
room in which patients are received, so that there may be no con- 
tagion of impressions from those who are leaving. The welcome 
given to the patient is Important, since the relation of the analyst 
to the patient is a very important part of the technique The ana- 
lyst is urged by some to shake hands and exhibit warmth and 
heartiness so as to establish confidence from the start Others rec- 
ommend that the analyst be pleasant but so far as possible neutral 
in his reception so that he will not be the stimulus to any trans- 
ferred emotional responses in the patient 

Most analysts spend a session or two taking a case history of 
the patient to include dates of important events and crises, names 
of relatives and friends, and other data which serve as a means 
of introduction and general orientation The question of fees must 
also be settled in these opening sessions Patients who recciv e ana- 
lysis gratuitously have not proven satisfactory patients, are 
unpunctual or irregular, lose interest readily, and tend to terminate 
the treatments abruptly, so that as a rule payments are recom- 
mended, however small An analysis must extend over a long 
period of time — several months at least The necessity for this 
should be discussed frankly with the patient at the outset and 
daily consultation periods should be planned at the outset to 
extend indefinitely into the future. To these engagements the 
analyst must be strictly punctual 

The routine of the session should be studiously preserved 
Although the analyst must try to gam the subject’s confidence, the 
session should be business-like and professional The fewer the 
intimacies (i e , outside of the content of the analysis) that de- 
velop between analyst and subject, the more successful the analy- 
sis. Note-taking is generally condemned, although perhaps a few 
random notes to help keep names or events in mind are per- 
missible. 

Freud says that he adheres firmly to the plan of requiring the 
patient to recline upon a sofa, while he sits behind him out of 
sight. Reclining encourages relaxation and facilitates the breaking- 
down of tensions and inhibitions. To have the analyst out of sight 
removes still another disturbing stimulus for the subject. It is 
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also important that the subject should not perceive the impres- 
sions that his revelations make on the analyst, and the easiest way 
to prevent this is for the analyst to keep out of sight 

The analysis is essentially an oral free-association on the part 
of the subject Such a simple and trivial method would seem par- 
ticularly sterile were it not for the tendency of the associations 
continually to veer around in the diiecUon of significant and dis- 
tressing complexes Freud (8, Vol II, p 355) says of commenc- 
ing the analysis, “What subject-matter the treatment begins with 
is on the whole immaterial, whether with the patient’s life-story, 
with a history of the illness or with recollections of childhood, 
but in any case the patient must be left to talk, and the choice of 
subject left to him ” 

One fundamental rule of psychoanalysis can best be described 
in Freud’s own words (8, Vol. II, p 355) . 

“One thing more, before you begin Youi talk with me must 
differ in one respect from an ordinary conversation Whereas 
usually you rightly try to keep the threads of your story together 
and to exclude all intruding associations and side-issues, so as not 
to wander too far from the point, here you must proceed differ- 
ently You will notice that as you relate things various ideas will 
occur to you which you feel inclined to put aside with certain 
criticisms and objections You will be tempted to say to yourself: 
‘This or that has no connection here, or it is quite unimportant, 
or It IS nonsensical, so it cannot be necessary to mention it.’ 
Never give in to these objections, but mention it even if you feel 
a disinclination against it, or indeed just because of this Later on 
you will perceive and learn to undcistand the reason for this in- 
junction, which IS really the only one that you have to follow. 
So say whatever goes through your mind Act as if you were 
sitting at the window of a railway tram and describing to some 
one*behind you the changing views you see outside Finally, never 
forget that you have piomised absolute honesty, and never leave 
anything unsaid because for any reason it is unpleasant to say 
it.” 


With the train of free association started, the game begins — a 
subtle contest between the analyst, unearthing secret and hidden 
complexes, and the subject, more or less unconsciously trying to 
avoid revealing them. There are many tricks which the subject 
will employ, such as spending a long time in discursive associa- 
tions of no particular import, talking superficially or about im- 
mediate current affairs. Another subterfuge employed by the 
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subject is the preparation each day of his story whereby material 
is carefully selected ahead of time and that which he resists dis- 
closing IS kept secret. 

Through all of these incoherent ramblings the watchful an- 
alyst is able to discern focal points which indicate complexes 
These may be discovered by watching for the “complex indicators” 
which were described in the chapter on free association Particu- 
larly important are contradictions, gaps in memory, supplemen- 
tary material inserted when repeating the description of an inci- 
dent, etc. As the associations lag, the analyst may ask for the asso- 
ciations that are called up by one or another word, name, or 
phrase mentioned previously In the later work there may be a 
great deal of this winnowing and probing by calling for the asso- 
ciations aroused by particular settings Where these arouse emo- 
tions, pain, blocking of any kind, the probe must digress tempo- 
rarily but always working to uncover the repressed material For 
the mam task of psychoanalysis is to unearth by free association 
the submerged wishes and desires which are held repressed by 
the respectable, society-sensitive ego. 

One very important phase of analytic technique is the analysis 
of the subject’s dreams. In psychoanalytic theory dreams are held 
to be the expression of the basic but repressed desires which are 
distorted by various mechanisms of symbolism. A debated and 
criticized aspect of Freudian psychoanalysis is the codification of 
dream symbolism as though there is or tends to be a definite and 
necessary correspondence between a particular dream symbol or 
dream content, wherein it occurs, and a particular diagnostic sig- 
nificance or psychoanalytic connotation. The interpretation of 
dreams is especially complex because of the variety of mech- 
anisms by which dream material is distorted. Freud says. “In 
general it is doubtful in the interpretation of every element of 
the dream whether it (a) is to be regarded in a negative ot 
positive sense, (b) is to be interpreted historically, (c) is sym- 
bolic, (d) Its valuation is to be based upon the sound of its verbal 
expression ” 

A psychoanalytic diagnosis must depend entirely on the sage 
interpretations which the analyst can make in the light of psycho- 
analytical theory. The analyst is always sensitive to evidences of 
the existence of complexes, he watches for indirect expressions of 
sexual gratification, the objects or persons to which it is attached, 
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and the degree to which there is satisfaction or unsatisfied long- 
ings and desires The analyst watches particularly for the rela- 
tionship of the subject to familiar persons in the life story as it 
IS unfolded, to discover possible hidden sexual relationships which 
would never be recognized and acknowledged from everyday sur- 
face events even by the subject himself. The analyst is quick to 
explain otherwise colorless admissions or denials by one or another 
of the common “F reudian” mechanisms, thus tending to give them 
a hidden, secret, sexual significance. 

Other features of the analysis — ^its length, decision as to the 
proper time to disclose interpretations to the patient, or the im- 
portant transference to the physician of emotions of love or hate, 
cannot be discussed here, as they belong primarily to psychoan- 
alysis as a therapy and not as a diagnostic technique In fact 
much of the cautiousness with which an analysis proceeds is based 
on therapeutic considerations rather than on a regard for what 
IS essential to the diagnosis. Most psychoanalysts maintain that 
they are able to divine the essential features of the case long 
before it is possible to reveal them to the patient. 

Indeed Freud practically admits that if an understanding of the 
case by the physician is all that is wanted, psychoanalysis is an 
awkward method of attack He says (8, Vol. II, p 362): 

“In the early days of analytic technique it is true that we re- 
garded the matter intellectually and set a high value on the pa- 
tient’s knowledge of that which had been forgotten, so that we 
hardly made a distinction between our knowledge and his in 
these matters. We accounted it specially fortunate if it were pos- 
sible to obtain information of the forgotten traumas of childhood 
from external sources, from parents or nurses, for instance, or 
from the seducer himself, as occurred occasionally; and we has- 
tened to convey the information and proofs of its correctness to 
the patient, in the certain expectation of bringing the neurosis and 
the treatment to a rapid end by this means It was a bitter disap- 
pointment when the expected success was not forthcoming.” 

In other words, Freud admits that If the facts are what is 
wanted, one need not use the cumbersome psychoanalytic method 
but may search and gather them wherever most available. In 
practical application in diagnosing a child’s problems one need not 
question the child exclusively, but should gather evidence from 
teachers, parents, or friends, as most convenient. 
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Evaluation of Psychoanalytic Technique 

Psychoanalysis as a diagnostic technique is inaccessible to 
validation by the standard tests which are employed in determin- 
ing the leliability and validity of evidence Psychoanalysis, since 
it can never be other than the intimate confession of subject to 
analyst, is immune to the usual scientific checks which may be 
used. In consequence of this, our evaluations of it must be de- 
ductive, we must fall back upon our knowledge of the worth of 
related techniques. To most of the criticisms leveled against it 
psychoanalysis has prepared a peculiar kind of answer which is 
invulnerable to ordinary psychological logic 

Psychoanalytic diagnosis may be criticized in the first place by 
questioning the accuracy or objective reference of the patient’s 
free associations. Our knowledge of the psychology of testimony 
would lead us to expect that here as elsewhere there would be 
serious gaps or distortions of memory and observation Where 
there is emotion or a complex, the interpretation of events is par- 
ticularly apt to be in error Although it is well known that mem- 
ories lose their accuracy with the lapse of time, psychoanalysis 
makes much use of memories of childhood occurrences, with little 
regard for the probability of distortions The psychoanalyst ad- 
mits there are errors and lapses in observation, memory, and 
interpretation, but maintains that he recognizes them when they 
occur and properly attributes them to the mechanism which oc- 
casioned them In fact, practically nothing that the subject dis- 
closes is taken at its face value — all is interpreted in the light 
of psychoanalytic theory Freud, drawing on the results of the 
free association experiment, concludes that trains of free associa- 
tion follow certain laws and are governed by certain forces They 
may coincide with reality only rarely and in dreams may be fairly 
fantastic, and yet they are constantly infringing on reality Since 
the psychoanalyst readily admits that testimony is fact mixed with 
fiction, the sting of this, first criticism is removed. 

While considering the accuracy of the testimony one should dis- 
cuss another criticism that has been raised against psychoanalytic 
diagnosis To what extent does knowledge of psychoanalytic 
theory color and distort the subject’s free associations? If a sub- 
ject knows that psychoanalysis always looks for some ungratified 
sexual desire, and if he voluntarily undergoes analysis, does he 
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immediately play up to what he thinks is expected of him^ This 
IS a serious question Few people undergo psychoanalytic treat- 
ment without having some idea of what psychoanalysis is Does 
the psychoanalyst ever know when he is being humbugged by 
having his subject say the things which he thinks he is expected to 
say? 

If so much depends on the interpretation which the analyst 
makes of the free associations of his patient, does not our knowl- 
edge of the psychology of judgment give us good ground for 
entirely discrediting a psychoanalytic diagnosis^ If it is so diffi-, 
cult to rate the character of pupils, employees, and friends even 
after close observation, is it possible that the analyst can read the 
truth from the random mental wanderings of his subject by his 
own sheer interpretative abilities? 

Psychoanalysis has been severely arraigned for its methods of 
arriving at conclusions which would never for an instant be ad- 
mitted by science or in a court of law Tannenbaum in a clever 
aiticle gives a clear exposition of how Freud in interpreting one 
of the cases he describes, jumps at conclusions prematurely and 
with insufficient evidence He draws one conclusion where, by his 
own theories, other conclusions could fairly be drawn. Tannen- 
baum (21, p 62) concludes his “exposure” by stating, “It is the 
wilfulness, arbitrariness, and capriciousness of the Freudian tech- 
nique and inteipretations appearing in every instance of an ‘an- 
alysis’ submitted to critical tests that prove orthodox psycho- 
analysis to be nothing more than a cleverly conceived therapeutic 
system which has no more scientific validity than Chiistian 
Science ” 

If Freud with his genius and plausibility and ingenuousness is 
so vulnerable to attack, what of the less capable amateur “an- 
alysts” who tiy to employ his methods'* Here unsubstantiated as- 
sertions are crassly in evidence Take for instance the following 
statements found in Zachry’s Personality Adjustments of School 
Children (27, p 55) “It has already been suggested that Ned’s 
scowling and tantrums are his defense against his feelings of in- 
security and infeiiority” Or (27, p 115), “Esther made up for 
difficulties, or made an attempt to compensate for them, by sub- 
stituting school success for everything that she lacked Her failure 
to compensate was due, in part, to the fact that she had no inten- 
sive interest in her school work. It maji be that because her 
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thwarted desires and repressions are concerned with her love 
needs, they cannot be compensated on a purely ego level, the level 
of her school work ” Or consider such statements as are found in 
Rogers’ (i8) Measuring Personality Adjustments in Children 
Nine to Thirteen, “Long contact with a psychiatrist revealed the 
fact that these fears of being attacked seemed to take their root in 
a very early experience when he was attacked and severely bitten 
by a dog ... It seems possible that the early conditioning fear 
of the dog colored much of Walter’s life.” Or in another case' 
“The stealing and eneuresis were more or less deliberate weapons 
against the aunt, who was both unsympathetic and unwise ” The 
fashion set by Freud of interpreting all unusual or undesirable 
behavior in terms of adjustment mechanisms may finally result in 
pure speculation on the part of his followers. Even though pos- 
sessing a certain plausibility, such statements as the foregoing 
cannot be considered as inevitable conclusions from the evidence 
presented. 

Freud is apparently insensitive to these attacks, although he 
IS aware of them He possesses a boundless confidence in the ac- 
curacy of his interpretations For instance, with regard to dreams 
he says (9, p 195), “What might, for example, impress you as 
arbitrariness in the interpretation of symbols, is compensated for 
by the fact that as a rule the connection of the dream thoughts 
among themselves, the connection of the dream with the life of the 
dreamer, and the whole psychic situation in which the dream oc- 
curs, chooses just one of the possible interpretations advanced and 
rejects the others as useless for its purposes ” Freud never forces 
an interpretation The tram of free associations advanced must 
continue until some one interpretation becomes irresistible. Thai 
is, psychoanalytic diagnosis depends for its validation on the de- 
gree of consistency and harmony which its interpretation contains 
rather than its obedience to the laws of inference It is undeniable 
that psychoanalytic interpretations have a sort of enticing plausi- 
bility to even the superficial observer. 

Psychoanalysis as a diagnostic technique should be viewed with 
an attitude of favorable skepticism. It cannot be scientifically 
validated and can be severely criticized when studied in the light 
of the psychology of testimony, judgment, and inference On 
the other hand, its keen application of psychological mechanisms, 
coupled with a dynamic psychological theory of considerable vigor, 
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gives it a somewhat impressive plausibility. Psychoanalysis should 
be considered a promising hypothesis. If offeis unique challenge 
to the psychologist of scientific bent Perhaps the severest ar- 
raignment of the psychoanalytic method concerns its unconscion- 
able demand for unlimited time Its uneconomical procedure con- 
stitutes a challenge to the psychological engineer who, aware of 
the significance of the mechanisms it uncovers, can devise some 
means of breaking resistance more easily by methods that can be 
standardized and which can be employed with groups. Such de- 
vices are certain to arouse opposition at first because they probe 
into things ordinarily considered private and personal, and be- 
cause early attempts are .almost certain to be cumbersome. But 
this survey of the whole range of techniques for the diagnosis 
of conduct points to psychoanalysis as the unknown promised 
land. 
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Chapter XIV 

EXTERNAL SIGNS OF CONDUCT 

T here is a thnvmg band of quacks and charlatans who fre- 
quently go under the name of “applied psychologists,” but 
who really are pseudo-psychologists, traveling about from 
city to city, selling their wares m lecture halls, churches, lodges, 
and industrial plants, playing on the credulity of the gullible, work- 
ing on the enthusiasm inspired by hope, and claiming to teach 
how to “read” character from its physical signs They depend on 
the tendency of the uneducated to geneialize fiom specific cases. 
They rely on the compliance that is usually accoided one who 
speaks fair, pompous woids full of authority 
On the other hand theie is the group of self-elected St. Georges 
who champion the cause of scientific psychology against pseudo- 
psychology, laying about with lusty blows to repel the menaces of 
superstition and ignorance These defendeis of the faith, alas, do 
not write with the calm dispassionateness of scientists, for they are 
so outraged by the easily won populaiity of the pseudo-ps) chol- 
ogists as to become vitriolic in their denunciations, in fact they 
are as intolerant in their vituperations as those whom they assail 
are plausible in their inferences 

Fortunately there has also been a small gioup of level-headed 
thinkers who have been willing to cxpeiiment and determine the 
facts, who approach the problem without bias and are willing to 
see relationships when they have been proven to exist, but who 
also demand that the claims be given fair, impartial trial on a 
sufficient number of cases 

The interests of those who attempt to infer or predict conduct 
from external signs may be classified into the following groups: 

I. Events occurring outside the body 
a. Astrology. 

2. Characteristics of the body without even a remote relationship 
to conduct, 
a Palmistry, 
b. Phrenology. 


503 



504 Diagnosing Personality and Conduct 

3. Physique 

a. Height 

b. The morphologic index. 

c. Kretschmer’s types. 

4 Characteristics and stigmata 

a. Lombroso’s stigmata. 

b. Estimating character from physiognomy. 

i. Facial profile. 

c. Estimating character from photographs. 

d. Blonde-brunette types. 

e. Shape of hand 
5. Handwriting 

Even this list omits some of the most fantastic of character sys- 
tems, such as numerology, indology, and fortune-telling from 
playing-cards or tea-leaves. 

Events occuiring outside the body that have no possible rela- 
tion to one’s destiny may be dismissed immediately as un- 
worthy of our attention It seems strange that in this enlightened 
age masses of people not only in India but also in the western 
hemisphere, America included, still place their faith in astrology. 
At one time astrology so influenced the “scientists” of the day 
that such a word as lunatic even became a part of the language. 
But to-day we recognize that the planets and signs of the zodiac 
have not even the remotest influence in guiding human affairs 

With almost equal abruptness we may dismiss the two pseudo- 
scientific systems, palmistry and phrenology There is a certain 
type of mind which loves to make much of the trivial and to find 
in it occult portents. The significance of the lines of the hand, 
which in sober fact correspond to natural divisions between muscle 
groups, has been magnified until some have found in them a com- 
plete system of character analysis The famous anatomist Alex- 
ander MacAlister * says of this, 

“That these mechanical arrangements have any psychic, occult, 
or predictive meaning is a fantastic imagination, which seems to 
have a peculiar attraction for certain types of mind, and as there 
can be no fundamental hypothesis of correlation, its discussion 
does not he within the province of reason.” 

Phrenology is a system which undertakes to diagnose character 
by means of swellings or protuberances on different parts of the 
skull This system, first formulated by F. J Gall about 1800, was 

* Encyclopedia Britannica,^ i4l.h Edition, Vol. XVll, article “Palmistry.” 
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due to a false conception of the function of the brain It was 
assumed m a crude form at that time that the different sections of 
the brain are the seats of different mental “faculties” or “powers ” 
Consequently there was supposed to be a correspondence between 
the development of certain sections of the brain (which could be 
noted by studying the external contour of the skull) and certain 
faculties or personality characteristics of a person. Gall went to 
work empirically (on a few cases) to locate the seats of these 
faculties. Charts and diagrams have been developed mapping the 
location of such characteristics as love of approbation, cautious- 
ness, combativencss, destructiveness, the perception of time, tune, 
and number, and the like 

So far as can be shown, this system does not deserve the least 
amount of serious consideration In the first place, it has long since 
been demonstrated that the brain does not opciate functionally m 
any way that corresponds to the system of phrenology 

Indeed, the recent work of Lashley indicates how far we have 
left behind this view that there is a necessaiy localization of any 
complex function (such as the phrenologists always talk about) 
in any part of the brain, one part assuming the work and function 
of another part if necessary. The system of phrenology is also false 
m that the “mental faculties” on which the system is based have 
long since been shown not to exist Human behavior is a dynamic 
system of response to stimulation and is not an expression of 
certain faculties. MacAlister* likewise disposes of phienology as 
follows ** 

“Psychology, physiology, and expeiience alike contribute to dis- 
credit the practical woiking of the system and to show how woith- 
less the so-called diagnoses of chaiartei really aie Its application 
by those who are its votaries is seldom worse than amusing, but it 
is capable of doing positive social harm, as in its proposed applica- 
tion to the discrimination or selection of servants and other sub- 
ordinate officials ” 

Physique as Diagnostic of Conduct 

The systems which propose to diagnose personality by a super- 
ficial survey of a man’s physique are many. It is only natural that 

• Encycloptcdta Bntanmca, 14th edition, Vol XVII, article “Phrenology ” 

•• See also Paterson, D G , “Personality and Physique,” in T he MeastiTtment 
qI Man (University of Minnesota Press, 1930}. 
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there should be postulated a relationship beiween the superficial, 
outward characteristics of size and shape of the body and the 
expression of this body in conduct These claims must be given 
some serious consideration 

One of the most natural of such claims is that tall men have 
certain possibilities for aggressiveness or leadership not accoided 
to small men It has been claimed that tall men make more suc- 
cessful salesmen, that tall men are chosen to be leaders by the 
members of a group, and that height is an advantage whcrevei 
there is a call for dominance over otheis It is a common observa- 
tion that school superintendents when together m a group appear 
to be above average in height Yet to cite one example on the 
other side, Napoleon achieved prominence as a leader, although 
only a very short man. 

Several investigators have studied this asserted relation and 
provide us w'lth a check on it Gowm (27) obtained the heights 
and weights of 6,037 leading men such as governors of States, 
United States senators, mayors of leading cities, bishops, railroad 
presidents, etc. The average height of this group was 714 inches 
and the average weight was 18 1 i pounds These figures may be 
compared with 221,819 applicants for life insurance, whose aver- 
age height was 68 5 inches and average weight 166 pounds In 
Gowin’s study, those who weie in more responsible positions ex- 
ceeded those in less responsible positions in both height and 
weight. Kitson, on the other hand, in studying the relationship 
between success in salesmanship and height, found that the taller 
salesmen were no more successful than the shorter. Kitson says 
(37, P- 92). 

“Thus we see that the salesmen investigated are not appreciably 
taller than men in general While the extiemely tall salesman may 
in the long run possibly excel the extremely short salesman, he 
will not excel the salesman of average height Indeed, the figures 
show that he will not do so well as the man of medium height 
Accordingly, we should not be justified m looking to height as a 
physical criterion of success in salesmanship.” 

Bellingrath (3), in a study of school leaders as determined by 
office-holding in extra-curricular activities in high school, finds that 
the average height of seventy-four leaders was 67 7 inches against 
65.9 inches for those in the same classes who had not held office. 
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The leaders had an average weight of 140 pounds, while the non- 
leaders had an average weight of 129 pounds These differences 
were more pronounced in the girls than the boys. In the case of 
the girls, bisenal >’s of 44 in height and .42 in weight were 
found. 

We must conclude that height and weight are positively corre- 
lated with leadership and the accompanying characteiistics of 
leadership. But the ovei lapping is considerable and one would not 
be safe in picking candidates for positions of responsibility on the 
basis of height and weight alone. 

Two lines of investigation into the significance of bodily build 
have been followed One of these was pursued vigorously by 
Sancti Naccarati (48, 49, 50) under the caption of the “morpho- 
logic index.” The other centers in and is largely due to the stimu- 
lating suggestions of the German psychiatrist Kretschmer (42) 
concerning the relations of physique and character. 

Naccarati entered upon his problem with a background of 
knowledge of the work of the Italian anthropologists, particularly 
Viola. Naccarati recognizes two extreme morphologic or structural 
types. (Morphology is a biological science dealing with the form 
and structure of plants and animals ) M icros plane hiucs are indi- 
viduals with small trunks as compared with the development of 
their limbs. These are the persons who strike one as being long 
and thin. The macrosplanchmcs, on the other hand, are those 
with large trunks as compared with the development of their limbs. 
They impress one as being stout or fat Naturally it is recognized 
that these aie merely the extremes of a continuous gradation and 
that the largest group of men are no nnos plane hnics. Naccarati 
believes that these two extreme types represent fundamental dif- 
ferences in human temperament. The macrosplanchmcs represent 
those in whom the nutiitional and V 5 getative systems, in which 
energy is stored up, have had the most extensive development 
The microsplanchnics represent those in whom the systems of 
locomotion and manipulation by which energy is transformed and 
utilized are in the ascendancy. He believes that this latter type is 
correlated with a highei mental organization Physiology identifies 
the microsplanchnic type with hyperthyroidism. 

Naccarati developed a “morphologic index” for accurately meas- 
uring these differences m body types. The somewhat complex 
formula for computing the morphologic index is as follows 
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Moiphologic index (M 


length of two limbs 
volume of trunk 


More in detail the computation is carried out as follows (68, 
p. 448) : 

I. Sternum length 

2 Xipho-epigastric line. 

3 Pubo-epigastnc line. 

4 Transverse thoracic diameter (width of upper chest). 

5 Anterior-posteiior thoracic diameter (depth of upper chest). 

6 Transverse epigastric diameter (width of upper chest) 

7 Anterior-posterior epigastric diameter (depth of lower chest). 

8 Transverse pubic diameter (width of waist). 

9. Length of lower extremity. 

10 Length of upper extremity. 

It. Height. 

12. Weight. 

a. 1X4X5= thoracic index (volume) 

b. 2X6X7 = index of upper abdomen (volume) 
c 3X7X8= index of lower abdomen (volume) 
d a -|- b -f- c = trunk-value (volume) 

e 9 -f- 10 = limbs value (combined length) 

{ e — d = M. I (morphologic index) 

The larger the M I , the greater the tendency toward the micro- 
splanchnic type 

With all of this set-up and m spite of the refinements of these 
measurements, the results have been disappointing Naccarati 
himself found a correlation of 35, with seventy-five Columbia 
College students, between the AI I and the Thorndike “Intelli- 
gence Test for High School Graduates ” Sheldon (68), however, 
repeating the experiment in Chicago, found a correlation of only 
.14 between the M I. and the American Council Intelligence Test. 

Heidbreder (29), in investigations on college students, finds a 
correlation of only 03 ± .03 between the height-weight ratio and 
score on the “Minnesota College Ability Tests” for 500 men, and 
.04 ± .03 for 500 women. 

Naccarati and Garrett (49) found practically no relationship — 
and were later substantiated in this by Garrett and Kellogg (23) — 
between the M 1 . and the Woodworth personal data sheets, the 
Pressey X -0 tests, or ratings. In short, they were not able to 
demonstrate that there is any connection between the M. 1 . and 
temperamental qiialitiefj^ 
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In this connection work done by Faterson under the direction of 
Ileidbieder and Paterson at Minnesota indicates the probability 
of a slight relationship between physique and temperament Usmg 
the Heidbreder “Inferiority Attitude Self-Rating Scale,” a correla- 
tion of -f- .03 zt 03 was found for 673 college freshmen be- 
tween the height-weight ratio and the inferiority attitude score. 
Using 531 freshmen, a correlation of -(- ii ±.03 was obtained, 
the correlation of inferiority-superiority with height being 
-|- .10 ± .03 and with weight — 10 ± 03 As Paterson (55) sums 
up, “Apparently there is a slight tendency for those who are tall 
and thin to rate themselves higher on the scale,” i e , toward feel- 
ings of inferiority. 

Sheldon (69), however, found that the M 1 . correlates — .22 
with ratings on sociability and — .14 with ratings on leadership. 
Sheldon concludes from his experimental survey of the field that 
“the factor of general size, or bigness, seems to be related posi- 
tively to sociability, leadership, and aggressiveness,” and his evi- 
dence show's the possibility that a slight relationship may exist. 

Kretschmer (42) worked on practically the same problem from 
the psychiatric point of view, but he developed a new terminology 
which requires that his work be treated separately Kretschmer 
recognizes four types of individual, called respectively the asthenic, 
the athletic, the pyknic, and the dysplastic forms These forms are 
to be distinguished from each other by their bodily build, by the 
muscular development and placement of fat on the body, by the 
shape of the face and hands, and by the character of the skin and 
hair. These four types are succinctly characterized by Mohr and 
Gundlach as follows (47, pp. 118, ii9)' 

“The asthenic individual, said to be of the schizophrenic tem- 
perament, IS one who is of average height but is relatively tall for 
his weight. He is thin, with a long, narrow, shallow chest His 
shoulders are relatively broad contrasted with the diameter of his 
chest His muscles are thin and poorly developed, his skeletal 
structure is slight. The skin is thin and loosely attached to the 
underlying tissues. The face is characteristically long and narrow, 
with a prominent nose and clear-cut features. The facial angle is 
sharp, and the mid-face is relatively long. 

“The athletic physique is similar to the asthenic in general 
bodily proportions but all of the structures are thicker, firmer and 
of more robust development The shoulders are heavy, the chest is 
broad and of medium depth. The skeleton is heavily built. The 
muscles are thick, of good tone and are well contoured The skin 
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is thick and closely adherent. The face is relatively long and nar- 
row with proportions similar to the asthenic but with thick though 
well-defined features The facial angle is less maiked than in the 
asthenic and the lower jaw more heavily developed The athletic 
build is considered a variant of the asthenic 

“The pyknK habitus is described as one in which there is an 
increase m the volume of all of the body cavities. The head is 
large, the chest is voluminous and exceptionally broad and deep 
Although the shoulders are of moderate width they appear narrow 
contrasted with the broad chest The abdomen is full The skeletal 
structure is slight when compared with the general bulk of the 
individual and the extremities are relatively small and slender 
The hands are small and delicate Theie is a generous adiposity 
and the skin is thick and firm The face is round and the midface 
IS short. The complexion is luddy. 

“The dysplasttc type includes many deviants from the normal 
In this group are those physical forms evidencing disturbance of 
the various ductless glands The elongated form of the eunuchoid, 
hypoplastic forms and those in which there have been localized 
developmental disturbances, are included.” 


Kretschmer believes that there is a marked relationship between 
these bodily types and temperamental qualities In his volume, 
Physique and Character, he draws with the enthusiasm of the 
clinical worker a clear-cut picture implying a degree of relation- 
ship higher than it really is. 

His studies with the insane have led him to the view that there 
are two general types of functional insanity These two types also 
exist in minor degree in normal persons, as shown by what he calls 
the cyclothymic and schizophrenic temperaments. 

The cycloids (having the cyclothymic temperament) include 
those who wear their emotions on their sleeves They are ex- 
tremely expressive and tend to fluctuate in mood from one of 
joyful excitation to extreme depression The cycloid as we know 
him in daily contacts is the sociable, good-natured, friendly, genial, 
hail-fellow-well-met sort of an individual. He takes life as it comes, 
gets on well with people, and is ready to understand a joke. On 
some occasions he is cheerful, jolly, hasty, while at other times he 
is quiet, calm, soft-hearted, and easily depressed. The fat boy in 
Pickwick Papers and the irrepressible Mr Micawber in David 
Copperfield are examples of the type. Naturally one finds them 
with all degrees of variation. 

The schizoid (having the schizophrenic temperament), on the 
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other hand, is an unsociable, quiet, reserved, serious, eccentric 
individual His emotional life is hidden within There is a barrier 
between him and society. He is the man that no one knows or 
understands Only at rare intervals does the inner man break 
through, revealing the depth of feeling hidden nithin him On 
occasion he will be timid, shy, sensitive, showing fine feelings, 
nervous, excitable, fond of nature and books Again he will be 
found to be stolid, pliable, kindly, honest, indifferent, dull-witted, 
silent. 

Kretschmer believes there is a marked relationship between 
these two types and diffeiences of physique The asthenic and 
athletic types of bodily build go with the schizoid temperament. 
The tall, thin man tends to be the reserved, sensitive, unemotional 
individual. On the other hand, the pyknic types go with cycloid 
temperaments The fat, well-developed person is also the one who 
IS genial, friendly, and easy to meet Kretschmer gives the follow- 
ing table to show the extent of the relationship* 

Table 85 

Relation between Temperament vl Types and Physique 
(from Kretschmer, 42, p 35) 


Physique 

Cycloid 

Schizoid 

Asthenic 

4 

81 

Athletic 

3 

31 

Asthenic-athletic mixed 

z 

11 

Pyknic 

S8 

2 

Pyknic mixture 

14 

3 

Dysplastic 

, 

34 

Deformed and uncataloguable forms 

4 

13 


85 

17s 


Kretschmer’s claims have received confirmation in Europe and 
America. 

Wertheimer and Hesketh (79), working in Baltimore hospitals, 
present confirmation of Kretschmer’s hypothesis Eleven cases of 
manic-depressive insanity had an index of 233 (a low score indi- 
cating the stout build), whereas twenty-three schizophrenics had 
an average of 281 (indicating a tendency toward longness and 
thinness), but there was much overlapping From the table of 
their results it is evident that there is a close relation between 
the tendency toward the manic-depres^sive psychosis and the 
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pyknic body build, but that the schizophrenic cases although tend- 
ing toward asthenic or athletic types are not so closely confined to 
one type. 

Tvble 86 

Showing the Percentage Distribition of M\Nic-DEPRcssnxs and Schizo- 
phrenics According to Kretschmer’s Body Types 

(after Wertheimer and Hesketh) 


CLEAR VANIC-DEPRESSIYES Cl EAR SCHIZOPHRENICS 



N = 11. 

N = 6 

N = 23 

N= 13 

Body type 

All ages 

Age 2Q -f 

All ages 

Age 30 + 

Pyknic . . . 

• 4SS 

666 

4 i 

83 

Pyknoid 

364 

33 3 

130 

25 0 

Athletic 

90 

0 

26 I 

167 

Asthenic-athletic mixed . 

. 0 

0 

348 

25 0 

Asthenic 

0 

0 

174 

167 

Nuclear 

90 

0 

43 

83 


99 9 

99 9 

99 9 

99 9 


Mohr and Gundlach’s (47) study concerned primarily the per- 
formance of criminals who had been classified according to 
Kretschmer’s types, and they obtained social data indicative of 
the temperament and personality of the groups. Nineteen asthenic 
criminals as compared with forty-four pyknic criminals showed a 
larger percentage of crimes against property and a small percent- 
age of crimes against the peison The asthenic group had a much 
lower record of previous imprisonment. A smaller percentage of 
the asthenic group were married and a smaller percentage be- 
longed to fraternal organizations. These groups differed in race, 
age, and social status, which may impair to some extent the clear- 
cut character of the findings All of these findings, however, are in 
harmony with Kretschmer’s theories regarding the temperamental 
differences between the two groups. 

Farr (20), working in the Pennsylvania Hospital, reports 
studies of the relationships between anthropometric measurements 
and types of psychosis. Very careful measurements were made of 
a group of sixty patientSj including eleven characterized as 
schizoid and sixteen as affective. Farr concludes from his studies 
(20, pp. 23s, 236): 

“The figures suggest a rather definite association of secluslve 
and schizoidal personalities with the slender, relatively elongated 
types — often with dysplastic features— and of the affective per- 
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sonalities with intermediate or definitely thick-set physiques This 
is entirely in agreement with the observation of others, but the 
outstanding exceptions and the questionable correlations are so 
numerous that anthropometry must be looked upon rather as 
interesting and suggestive than as diagnostic.” 

One of the difficulties with Kretschmer’s system is that he has 
given only a descriptive characterization of the types; he has not 
defined them rigorously in quantitative terms. With this loose kind 
of classification instead of measurement to go by, investigators 
have formulated widely different ciiteria of the various types or 
have placed different amounts of emphasis on the criteria 
(physique, face, hair, skin, etc ), so that there is wide vai lability in 
the incidence of the types among insane patients studied by differ- 
ent investigators. Some have found it difficult to distinguish be- 
tween the athletic and asthenic groups. Kretschmer in recent 
writings would combine these two groups into a single leptosovie 
type, but hdohr and Gundlach do not believe this is justi- 
fiable. 

Although Kretschmer used measurements to describe his types 
after he had selected them, he apparently is not willing to advo- 
cate making the original selection on the basis of measurements. 
He says (42, p. 13): 

“It is superfluous [to procure statistical values for every single 
measurement of the body] because even within a single people we 
can give only approximate figures, since, when dealing with body- 
forms, we have not to do with cleaily maiked unities but with 
ill-defined types, where, in the case of certain border-line cases, it 
depends on the investigator himself whether he includes them on 
the list of the one single type or not ” [And in another place] “The 
important idea about a type is that it possesses a firm center but 
not hard and fast boundaries Types as a rule can only be deter- 
mined intrinsically, we cannot maik their boundaries By ‘type’ 
we mean a nucleus of more distinct and among themselves quite 
firm formations which have been deliberately lifted out from a sea 
of progressive transitions. This holds good for a racial type as 
well as a personality or a clinical reaction type.” 

The breakdown of Kretschmer’s classification system is due to 
his attempt to approach the problem through distinctions of types. 
Types even when they do not violate logic are not reconcilable 
with statistics and the fundamental principles of measurement. 
The truth of the matter is that all of the qualities which go to 
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characterize a type can be measured on a continuous scale of more 
or less It is piobable that all of them would form distributions 
approximating the normal probability curve, with few at the 
extremes and many near the median Each of these qualities (body 
dimensions, facial contour, skin texture, hair) has more or less 
correlation with temperamental differences. But they are also 
more or less imperfectly correlated among themselves, so that it is 
seldom that an individual illustrates a pure type The problem of 
using physique in the diagnosis of conduct thus becomes a statisti- 
cal problem of correlation The task of isolating the significant 
variables and combining them with the best weighting for the 
diagnosis of temperamental differences is a problem yet to be 
worked out 

Enough has been done in this field to show that there are certain 
relationships between physique on the one hand and personality 
on the other that have diagnostic value The clinical experience of 
such psychiatrists as Kretschmer has uncovered promising leads 
But the scientific work of determining the exact relationships and 
making use of these relationships to fashion a measure of diag- 
nostic significance yet remains to be done Even when it is done, 
it cannot be used as a complete measure of temperamental differ- 
ences, for there will still remain such factors as age, environmental 
stress and strain, etc , to accentuate or reduce the clear-cut tem- 
peramental manifestations 

Bodily Characteristics and Stigmata as Diagnostic 
of Conduct 

Our discussion cannot come to an end when we have taken up 
general physique only, for every conceivable characteristic of the 
human body has at one time or another been believed to be indi- 
cative of tendencies in conduct Many of these imaginary relation- 
ships are so absurd as to be unworthy of our attention, yet they 
have gained the support of adherents of prominent schools of 
thought as well as of quacks and charlatans. 

One famous theory which we may appropriately use to intro- 
duce this section is the work of the school of Italian criminologists 
headed by Lombroso Lombroso, who devoted his life to the study 
of the criminal, tells us as follows how he first discovered that 
there is a criminal type: 
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“In 1870 I was carrying on for several months researches in the 
prisons and asylums of Pavia upon cadavers and living persons, in 
order to determine upon substantial differences between the insane 
and criminals without succeeding very well Suddenly the morning 
of a gloomy day in December, I found in the skull of a brigand 
a very long series of atavistic anomalies, above all an enormous 
middle occipital fossa, and a hypertrophy of the vermis, analogous 
to those that are found in inferior invertebrates At the sight of 
these strange anomalies, as a large plain appears under an in- 
flamed horizon, the problem of the nature and of the origin of the 
criminal seemed to me resolved, the characters of primitive men 
and of inferior criminals must be reproduced in our time.” * 

This discovery in a single individual led to the rearing of a com- 
plete theory that criminals could be identified by certain “stig- 
mata,” said to be atavistic in nature This theory has been carried 
to extreme lengths. Criminal characteristics, we are told (26) by 
the Lombroso school, have dark and thick hair, sometimes woolly 
in texture; a skull which may vary in five directions* (a) it may 
be rounded like a dome, or (b) depressed like a roof that is flat 
or low, or (c) its vault may be keel-shaped, or (d) it may be 
bulging, with a protuberance on one side, or both sides, or in front, 
or behind, or (e) it may have a sugar-loaf appearance, eyebrows 
that are beady or scanty, a nose that is defective and is fre- 
quently without a bony skeleton, ears that are long and thick; 
skin that IS pale and wrinkled, lips that are cleft; teeth with the 
molars undeveloped, wisdom teeth absent, the canine teeth over- 
developed, etc. 

“Finally, to select from a list of remaining characteristics, we 
must add that, according to various authorities, the male criminal 
has often the bust of a female and the female criminal the beard 
of a man, and that both male and female suffer from infantilism; 
that the criminal has an ape-likc agility and a prehensile foot, that 
he IS left-handed and ambidextrous, with his right hand smaller 
than his left, and his left foot smaller than his right, that he stam- 
mers and squints; that he sleeps soundly, tattoos his body, is given 
to the early use of tobacco, is sensitive to the weather, and is 
seldom seen to blush !” 

Goring ( 26) in an extensive study in which he reports the results 
of anthropological measurements on 3,000 English criminals, sub- 

•From a speech made by Lombroso at the Congress of Criminal Anthro- 
pology, held at Turin in 1906. , 
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jected many of the Important claims of the Lombroso school to the 
test of careful measurement and statistical treatment After finding 
the correlation between vanous measurements of the head, face 
and other physical characters and the nature of the crime, he 
concludes : 

“It will be seen that ten only of the thirty-seven characteristics 
have correlations with nature of crime greater than i, and that 
the correlations of the remaining twenty-seven are either insignifi- 
cant, relatively to their probable errors, or so small m value as to 
be legitimately ignored in such limited samples as those we have 
been examining Of the ten above i in value, three only are above 
.2, and only one above 3 in value With the exception of these ten, 
which will require more detailed investigation, we may say that 
these physical characters have no significant association with the 
nature of the crime committed In other words, we conclude that 
if there be any real association between physical characteristics 
and crime, this is so microscopic in amount as not to be revealed 
by the values of our correlation ratios and coefficients of con- 
tingency.” 

These ten characters which show correlation were studied with 
reference to their relation to other selective factors His inquiry led 
him to state, “Physical differences exist between different kinds of 
criminals, precisely as they exist between different kinds of law- 
abiding people. But, when allowance is made for a certain range 
of probable variation, and when they are reduced to a common 
standard of age, stature, intelligence and class, etc, these differ- 
ences tend entirely to disappear.” Finally he compared the physi- 
cal characteristics of criminals with those of non-criminals and 
decided, “No evidence has emerged confirming the existence of a 
physical criminal type, such as Lombroso and his disciples have 
described.” “There is no such thing as a physical criminal type.” 

Similar systems of character analysis based upon the superficial 
observation of external characteristics have been advocated for 
use in industry and vocational guidance. One thoroughly worked- 
out system is that sponsored by Katherine Blackford (8) Her 
book on The Right Job — How to Choose, Prepare for, and Suc- 
ceed in It, is most intriguing. Its appeal to the semi-educated man 
must be overwhelming. Here are diagrams and pictures showing 
in the clearest outline the significance of the shape of the head, the 
face and the facial profile, the shape, size and flexibility of the 
hand, bodily proportions, complexion, etc, The captions under the 
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pictures are also disarmingly convincing (8, p. 35): “Example of 
high, narrow head, showing ambition, keen intelligence, and in- 
tuition, with humor, spirit, sympathy, and affection” (8, p 13 1 ) 
“Professional type, with indications of mechanical and inventive 
ability Note blonde color, height and width of forehead, especially 
in upper middle, with squareness of jaws and width between 
eyes.” * 

One may pick out absurd generalizations on every page (8, p 
193). “The person who finds the applause of others a heavy stim- 
ulant has a short upper lip ” (8, p 166 ) “The organizer must 
have a ‘high head,’ both at the crown and also at the top or that 
portion which is just back of the hairline and above the tem- 
ples ” ** and so on 

There is a kind of persuasiveness in much of the kind of reason- 
ing used by those who claim to be able to read character from 
physiognomy If the experiences of life leave their traces in the 
nervous system so as to effect future behavior, it is plausible that 
traces of experience are left m facial expression It is well known 
that the emotions tend to heighten all muscular activity and result 
in various muscular tensions So a life of strain leads to tensions 
around the mouth or in the furrow between the eyes that even- 
tually should result in a set expiession. Fatigue also leads one to 
try to relieve the fatigued parts by extra effort in unused muscles, 
and consequently the strain of fatigue is often imparted to facial 
expression 

The arguments underlying shape or contour of head or face are 
obscure. Most of them depend on analogies to function that have 
little basis in fact. 

Several of the claims commonly put forward by these character 
readers have been put to the experimental test One of the most 
ambitious of these is reported by Cleeton and Knight (12). Their 
work was carried out by obtaining physical measurements and 
character ratings on twenty-eight college students, members of 
two sororities and a fraternity Eight character traits were selected 
for study — sound judgment, intellectual capacity, frankness, will- 
power, ability to make friends, leadership, originality, and impul- 

•From Blackford, Katherine M H, and Newcomb, A, The Right Job — How 
to Choose, Prepare for, and Succeed in It, pp 3S and 13 1 By permiision of 
Katherine M H Blackford 

•• Ibid , pp 193 and 166 By permission of Katherine M H Blackford, 
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siveness,, these being traits upon which the phrenologists agree best 
in their systems of diagnosis A list of the physical alLiibutes com- 
•monly alleged to be diagnostic of each of these traits was as- 
sembled. There were twenty-eight physical characteristics for 
judgment, twenty-nine for intelligence, thirty-six for will-power, 
thirteen for ability to make friends, etc Accurate measurements 
with calipers, tapes, head square, and other instruments were 
made for each of the 201 physical characteristics mentioned 

The second part of the expeiiment consisted in obtaining ratings 
on the character traits Each of the members of three fraternities 
rated all of the other membeis of his fraternity on each of the 
eight traits These ratings, known as the ratings of “close asso- 
ciates,” are shown to be highly reliable Ratings by “casual ob- 
servers” were also obtained by seating the subjects on a platform 
where they were rated by seventy judges — business men, school 
superintendents, and students of personnel management These 
judges were accustomed to size people up in selecting for employ- 
ment, but claimed they did not use any system These ratings by 
the casual observeis were also highly reliable, but they correlated 
low with the ratings by the close associates 

Finally, each of the ratings was correlated with each of the 
physical measurements chosen as being symptomatic of a given 
trait These correlations as averaged are presented in the following 
table: 


Table 87 

CoRHELATION BETWEEN RATINGS OF CHARACTER FACTORS AND PHYSIOGNOMIC 
Measurements Alleged to Be Symftomattc of These Factors 


(from Cleeton and Knight, 12, p 229) 



Ratings of 

Ratings of 

Ratings of 


close associ- 

casual ob- 

close associ- 


ates and 

servers and 

ates and 


physical mca- 

physical mea- 

those of casual 

Traits 

sures 

sures 

observers 

Judgment 

. — 01 

14 

32 

Intelligence . . 

03 

.05 

.02 

Frankness 

05 

•15 

.21 

Friendliness . . 

. . — II 

19 

.18 

Will-power 

... — 07 

04 

26 

Leadership . . 

— 04 

.07 

•31 

Originality 

09 

08 

•32 

Impulsiveness 


-07 

.20 
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The investigators conclude that “physical factors purporting to 
measuie the same tiait do not present even a suspicion of agree- 
ment”, that “the correlation between latings of casual observers 
and physical measurements is best represented by o ooo”, that 
“physical measurements which underlie character analysis agree 
neither with themselves nor with other measures of character.” 
(12, p. 230.) 

These sweeping conclusions do not seem entirely justified by 
the correlations reported It is quite possible that the average of 
the twenty-eight physical measurements alleged to indicate judg- 
ment failed to do so, but that some one or more of the measures 
may have some significance. There may perhaps be a kernel of 
tiuth in some of the physiognomist’s assertions. For instance, in 
this study the correlation between ratings on judgment by close 
associates and the ratio of anterior length of head to posterior 
length of head is .29 ± .12, the ratio of anterior inferior length of 
head to posterior central length is 38 ± .12 To be sure, these are 
within the possible range of chance deviations from a zero correla- 
tion, but they are at least suggestive of a possible relationship. 
Cleeton and Knight’s study, although it denies the possibility of 
any widespread truth in the assertions of character readers, does 
not destroy the possibility that out of the welter of claims there 
may be established some physical indicia of value in the diagnosis 
of character. It is deplorable that these investigators did not report 
all of their results in detail instead of assuming that the averages 
tell the whole story. 

Among the claims advanced by the Blackford school of char- 
acter reading is one concerning the significance of the facial profile. 
The convex type with prominent nose, sloping forehead, and 
receding chin is said to indicate a person possessing “superabun- 
dance of energy,” and one who is “keen, alert, quick, eager, ag- 
gressive, impatient, positive, and penetrating ” The concave 
type of profile with prominent chin and high forehead indicates a 
person who is characterized by mildness and who is “slow of 
thought, slow of action, patient in disposition, plodding.” Alice L. 
Evans (32, 19), under direction of Clark Hull, investigated the 
character significance of the profile, using a specially devised meas- 
uring instrument for obtaining an accurate index of facial convex- 
ity. Twenty-five university women, members of the same sorority, 
rated each other on optimism, activity, ambition, will-power, 
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domination, popularity, blondncss These ratings were highly re- 
liable with the exception of those of popularity The significant 
relationships in the study aie shown in the following table 

Tviile 88 

Correlations between Measurements of Facial Profile and Personality 

Ratings * 

(from Evans and Hull, 32, p 130) 


Optimism Activity Ambition 

IVill- 

power 

Domi- 

nation 

Popu- 

larity 

Convexity — whole face 

with nose 

+ 10 

— OS 

— 17 

— 13 

— II 

— 03 

Convexity — chin to eye- 
brow without nose 

+ 37 

+ 39 

+ 33 

+ 34 

+ 24 

+ 17 

Height of forehead from 
eyebrow to hairline 

— 17 

— 29 

— 23 

— 39 

— 22 

— 10 


Hull’s conclusions from Evans’ work are helpful (32, p. 129) . 

“For the most part, convexity also seems unrelated to the 
various character traits A possible exception to this seems to 
be convexity of the lower face (chin to eyebrow without nose), 
which yields a curiously consistent senes of relatively large posi- 
tive correlations with all of the character traits. Because of the 
small number of subjects the probable error of these coefficients 
IS rather large, but the correlations are nevertheless suggestive. 
A second possible exception lies in the series of negative corre- 
lations between the various traits and the height of the fore- 
head from eyebrow to hairline If these coefficients may be taken 
at face value, they indicate that low foieheads tend to indicate 
optimism, activity, will, etc , but here again we must reserve 
judgment because of the small number of subjects. These co- 
efficients are sufficiently suggestive to warrant further investi- 
gations.” ** 

The possibility that some of these relationships may be greater 
than chance is tantalizing. If they should so prove it might be 
surmised that the judges unwittingly employed these physi- 
ognomic characteristics as a basis for their judgments. There is 
a strong presumption, however, for believing that the relationship 
between objective measures of conduct and physiognomic traits 
IS purely a chance relationship. 

•From Hull’s Aptitude Testing Copyright 1928 by World Book Company, 
Publishers, Yonkers-on-Hudson, New York 

••Ibid 
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Estimating Character from Photographs 

It seems probable that, if it is not possible to estimate char- 
acter from direct face-to-face observation on the basis of physi- 
ognomic characters, there is little possibility that one can judge 
character from photographs. However, there are systems of char- 
acter judgment based on photographs, and their claims must be 
considered. Two studies have been made on the relationship of 
character traits and judgments of photographs, one by Miss 
Cogan, working under the direction of H. L. Hollingworth (30), 
and the other by Miss McCabe, working under Clark Hull (32, 
4S). In each case a group of college women rated each other 
on character traits Since each group was composed of members 
of a sorority, the subjects knew each other intimately, and their 
judgments of character differences were as accurate as could be 
obtained under any circumstances The combined ratings had 
very high reliability A second squad of judges, unacquainted 
with the first, rated photographs of the subjects for the same 
character traits as those used by the subject groups. The cor- 
relations between the ratings made by intimate associates on the 
basis of acquaintance and those made from examination of photo- 
graphs are given m the following table- 

Table 89 

Correlation between Real Character Traits lnd Judgments of Character 
Based on Photographs * 

(from Hull) 


Trait 

McCabe's results 

Cogan’s results 

Neatness 

+ 07 

+ 05 

Conceit 

+ 21 

-f 19 

Sociability 

. + 12 

+ 29 

Humor . 

— 07 

+ 33 

Likability 

+ IS 

+ 38 

Intelligence 

-f 40 

+ SI 

Refinement 

+ 17 

+ S 1 

Beauty 

-t-61 

+ SS 

Snobbishness 

. . + 32 

+ S6 1 

Vulgarity 

... +10 

+ 6S 


McCabe’s results are the more dependable since they are 
based on a larger number of cases (forty) Only two of the cor- 

*From Hull’s A-ptitude Testing Copyright 1928 by World Book Company, 
Publishers, Yonkers-on-Hudson, New York 
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relations are of any size — ^those for beauty (+ 6i) and for intel- 
ligence (-f- 40) We would expect that beauty could be judged 
from a photograph, for beauty is “only skin deep.” As for the 
correlation of 40 with intelligence, this disappears when checked 
up against objective evidence of intellectual ability It is proba- 
ble that the relationship of 40 holds because all the judges used 
certain features such as seriousness, wearing glasses, a bright eye, 
etc , as aids in estimating intelligence from the photographs. 
Estimations of character from a photograph are worthless both 
because a single judgment is very unreliable and also because 
there is probably no relationship between physiognomic features 
and character traits 

Blond-Brunette Types 

Another of Blackford’s claims that has been investigated ex- 
perimentally concerns blond and brunette types There is a wide- 
spread belief that blonds and brunettes possess certain well-de- 
fined characteristics Blackford has been more definite than 
others, as the following description shows (7, p. 144) . 

“In brief, always and everywhere, the normal blond has posi- 
tive, dynamic, driving, aggressive, domineering, impatient, active, 
quick, hopeful, speculative, changeable, and variety-loving char- 
acteristics, while the normal brunette has negative, static, con- 
servative, imitative, submissive, cautious, painstaking, patient, 
plodding, slow, deliberate, serious, thoughtful, specializing char- 
acteristics ” 

These descriptions certainly challenge inquiry, and the honors 
for a critical and clear-cut investigation go to Paterson and Lud- 
gate (52) The twenty-six characteristics mentioned by Black- 
ford were listed in random order on a rating sheet. The directions 
for rating follow 

“Select from among all the people you know very well two 
who are pronounced blonds and two who are pronounced bru- 
nettes, and rate them with respect to the characteristics listed 
below. Put either a plus (-|-) or a minus ( — ) sign after each 
characteristic for each of the four persons you select for rating 
If your first blond is positive, put a plus sign after that charac 
teristic,_if not, put a minus sign and so on for the rest of the char- 
acteristics.” 
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Ninety-four college students acted as judges. If Blackford’s 
claims are correct, there should be a preponderance of plus 
signs for blonds and negative signs for brunettes for the trait 
“positive,” and so on down thiough the list As a matter of 
fact the differences between blonds and brunettes are remark- 
ably small, as is shown in the accompanying table. 

Table 90 

Percentages of Bionds and Brunettes Rated as Possessing Certain 
Personalitt Traits 

(from Paterson and Ludgate, $2, p 125) 

percentage rated plus 


Blond traits 

1S7 blonds 

187 brunettes 

Positive 

81 

84 

Dynamic 

63 

64 

Driving . .... 

49 

SO 

Aggressive 

62 

S6 

Domineering , 

36 

36 

Impatient 

S6 

SI 

Active 

88 

82 

Quick 

70 

68 

Hopeful ... 

8S 

8S 

Speculative . . . 

S3 

SI 

Changeable . . . 

S3 

43 

Variety-loving 

66 

62 


percentage rated plus 


Brunette traits 

187 blonds 

187 brunettes 

Negative 

. 16 

17 

Static 

28 

31 

Conservative . . . 

51 

61 

Imitative . . 

39 

40 

Submissive 

ZS 

26 

Cautious 

S4 

60 

Painstaking . 

S6 

61 

Patient 

43 

52 

Plodding 

Z7 

31 

Slow 

. 20 

24 

Deliberate 

47 

57 

Serious 

S8 

72 

Thoughtful 

67 

70 

Specializing 

SZ 

4 S 

These results show not only 

that there are 

no differences be- 

tween blonds and brunettes of 

the kind claimed, but that there 

is not even generally prevalent 

an organized or systematic belief 


concerning differences between blonds and brunettes which might 
have influenced the ratings. 
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Shape of the Hand 

Still another school of character readers professes to be able 
to use the shape and size of the hand for describing character 
traits. As with the other systems there is just enough analogy be- 
tween the use of the hands in different occupations and the shape 
best adapted to this use to make the claims seem plausible We 
read that the longer the first finger is when compared to the 
second, the more ambitious a person is, the farther a person 
can bend his fingers backward, the keener his mind, the longer 
the fingers in proportion to the length of the palm, the stronger 
the tendency to impulsiveness These claims are so absurd as to 
be beneath the dignity of serious consideration, but since they 
attract the attention of business men and others who must deal 
with personnel problems, their emptiness should be revealed 

Fortunately Miss MacLaurin (32. 44), another of Clark Hull’s 
students, has carried on an investigation to determine the facts. 
Again thiity members of a sorority rated each other on character 
traits mentioned m the literature of chirognomy , and the experi- 
menter made careful measurements of their hands. The following 
table illustrates the findings: 

Table 91 

COKKELATIONS BETWEEN CERTAIN MEASURED PROPORTIONS OF THE HaND AND 

THE Chjulvcter Traits Alleced to Be Indicated by Each 
(from Hull *) 

Hand trait Character trait Correlation P E 

Difference in height between first and 
second finger Ambition . + 19 ± 12 

Extent of backward fieiion of the hand, 
measured in degrees Keenness of mind 13 ± 12 

Degree of taper of fingers Refined sensibility + 16 ± 12 

Length of fingers divided by length of 
palm . Impulsiveness . . + 29 ± n 

Length of thumb divided by length of 
palm and second finger .. . Force of character — 23 ± 12 

Here again the correlations are suggestive, but the probable 
errors are so large as to prevent one from concluding that any- 
thing but a chance relationship exists. 

•From Hull’s Aptitude Testing Copyright 1928 by World Book Company. 
Publishers, Yonkers-on-Hudson, New York. 
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Handwriting 

The last in our list of external signs used as a basis for judg- 
ing character is handwriting Handwriting differs from the pre- 
ceding signs In being a product of activity rather than a char- 
acteristic of the body itself It is quite conceivable that one’s 
temperamental qualities should express themselves in handwrit- 
ing, and handwriting is fortunately left as a permanent record 
to be studied, facilitating methods by which this hypothesis can 
be tested Many persons have been fascinated by the hypothesis 
that handwriting reveals character Observation of handwriting 
is to-day one of the common methods used to obtain a quick and 
easy diagnosis of character It must be confessed that certain 
persons actually have acquired a skill in the art of reading hand- 
writing that enables them to diagnose traits with an accuracy 
greater than the expectations of chance But this art will not be 
generally useful until it can be codified into definite rules which 
will stand under the scrutiny of experimental investigation 

Clark Hull (33), who has conducted one such investigation, 
has reviewed for us some of the history of character reading 
through handwriting Camillo Baldo published a treatise on the 
subject m 1662, and the matter attracted the attention of such 
eminent men as Leibnitz, Goethe, and Sir Walter Scott An ex- 
tensive body of literature has grown up on the subject That 
men like William Preyer (58), professor of physiology at Jena, 
and George Schneidemuhl (65), professor of comparative pa- 
thology at the University of Kiel, are known as graphologists 
shows the extent to which the “science” of reading handwriting 
has won a recognized place in academic circles. Most of the 
graphologists, however, build systems and make claims without 
bunging them under experimental scrutiny. 

One of the most elaborate systems of graphology is that of Rob- 
cit Saudek (63), a man who has made an intensive study of hand- 
writing but has mixed his physiological studies with vague specu- 
lations. This system resolves itself into such generalizations as: 

“Predominance of the lower projection points to a precise, 
healthy, natural functioning of the muscular tonus, and often, in 
addition, a practical mind and commercial sense” (p 298). 

“The lack of vertical readjustments of the writing surface 
points to indolence and lack of adaptability” (p. 294). 
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In describing his method of detecting dishonesty, Saudek men- 
tions ten characteristics of handwriting Number 10 illustrates the 
method. 

“10 The letters o, a, d, g, q are open at the base and are there- 
fore (with the exception of the o) written in at least two strokes, 
with a clockwise movement” [He adds] “We know that this 
feature is the only reliable feature (as regards the formation of 
the letters) of the French school of graphology, and also that in 
the files of the police it occurs in 30 per cent of the signatures 
alone of habitual thieves of both sexes” (p 286). 

Evidence presented by Saudek is quite unconvincing, even In 
its fundamental assumptions, betraying a lack of knowledge of 
the ways in which character is organized For instance, in demon- 
strating his ability to detect dishonesty from handwriting, Saudek 
collected specimens of seventy-three individuals from nineteen 
business firms In fourteen cases he diagnosed dishonesty, and 
in fifty-nine cases honesty The firms in question confirmed the 
accuracy of his diagnosis in all these fourteen cases, for in each 
of these cases the writer had either been convicted in the courts 
of embezzlement, or had confessed his guilt but had not been 
prosecuted. This dramatic testimony to the power of graphology 
may be convincing to the business man, but the logic of the 
argument hardly squares with other facts that are known con- 
cerning the specificity of conduct. 

Binet (5), the famous French psychologist, was the first to 
study graphology scientifically He set out to study the possi- 
bility of determining sex from handwriting To each of his sub- 
jects he presented envelopes, 180 in all, addressed by an equal 
number of men and women Those who had no special training 
were able to make from 66 to 73 per cent (average 69 per 
cent) correct guesses, where a senes of guesses which are due 
to chance alone would yield about 50 per cent correct One graph- 
ologist made a score of 75 per cent, and the famous graphologist 
Crepieux-Jamin (13) made a score of 79 per cent. Downey con- 
ducted a similar experiment. Using untrained subjects, percentages 
of correct judgment running from 60 to 77 5, with an average 
at 67 3, were obtained. 

In another experiment Binet secured the handwriting of eleven 
notorious assassins and paired each with a sample of hand- 
writing by a law-abiding man. Again the famous graphologist 
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Crepieux-Jamin exceeded the chance expectation in making only 
three errors out of the eleven judgments, or a score of 73 per 
cent. Two other graphologists were able to make only six out 
of eleven correct judgments each. 

Hull and Montgomery (33) studied the claims of graphology 
by comparing exact measures of handwriting characteristics with 
ratings of the character traits they were supposed to represent 
in the same individual. First these investigators made a careful 
survey of the literature of graphology and culled out the asser- 
tions about which there seems to be the most agreement. These 
are given in the following table 

Table 92 

Claims Made of Relationship between Character Traits and 
Handwriting Cilvracteristics 

(from Hull and Montgomery, 33, p 66) ‘ •, 

Character tratt Indicated by handwriting having the foUowmg 

characteristics >’ 

Ambition . . Lines of writing sloping upward 

Pride . Lines of writing sloping upward 

Bashfulness . . . Writing traced with heavy lines 

Force . ... (a) Heavy lines, (b) heavy bars on the t’s 

Perseverance . Long bars on the t’s 

Reserve . . . Closed a’s and o’s 

Seventeen students, members of a fraternity, submitted samples 
of their handwriting These samples were written in a uniform 
but natural way, each copying a chosen passage Exact measures 
were made of the slope of the lines, tlie thickness of the lines, 
the thickness of the bars on the t’s and also their length, and 
count was made of the a’s and o’s which were closed. At the 
same time the subjects ranked each other on the character traits 
in question Correlations computed between the character traits 
and the handwriting characteristics alleged to be symptomatic 
of them are given m Table 93 

The average of these corielations is — 016, which represents 
about the amount of assurance that one should give to the claims 
of graphologists. Any one of the correlations in the table could 
have occurred as a chance deviation from zero, although certain 
of the coefficients are high enough to warrant further investiga- 
tion It would not be surprising to find that forceful people write 
heavier than timid people, or that bashful people write lightly, 
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Table 93 

Correlations of Ratings of Cn\R\CTER and Hand-writing 


(from Hull and Montgomcr >0 

Ambition with upward sloping lines . . . . — 20 

Pride with upward sloping lines . — 07 

Bashfuincss with fineness of lines . . — 45 

Bashfulness with lateral narrowness of m’s and n’j . + 38 

Force with heavy handwriting . — 17 

Force with heavy bars on t’s . . . • . . — 06 

Perseverance with length of bars on ^’s . . 00 

Perseverance with length of bars on Ps, varying size of writing com- 
pensated for . .. -f- l6 

Reserve with closed a's and o’s . • ... — 02 


but such does not seem to be the case when these characteristics 
of people are actually compared with their handwriting. 

Another of Hull’s students, Miss L E. Brown (9), starting 
out with a strong belief in graphology, performed a similar ex- 
periment with similar results as shown in the following table. 

Table 94. 

CoRRELtTION BETWEEN RATINGS OF CHARACTER TrAITS AND HANDWRITING* 
(from Hull, 32, p 130) 

Character trait Handientmg trait Correlation 


Bashfuincss . . Width of down stroke +.11 

Ambition , .Tendency to upward slope as line crosses 

page . +23 

Persistence Width of down strokes — OS 

Persistence Disconnected writing — per cent of breaks 

of line within words — .03 

Personal neatness Neatness in appearance of writing . -f- 23 

Personal individuality Individuality in appearance of writing . +.15 


Hull suggests the possibility that the relation between neatness 
and individuality and the corresponding characteristics in hand- 
writing may represent genuine tendencies 


Summary 

Out of this mass of assertions and claims for systems of char- 
acter reading the positive findings of exact measurement are 
pitifully few. One can say that they leduce themselves to (a) tall, 
large men tend to be more aggressive and sociable and to make 

•From Hull’s Aptitude Testing Copyright 1928 by World Book Company, 
Publishers, Yonkers-on-Hu 4 'on, New YorL 
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[Tiore successful leaders than short men, (b) there are tempera- 
mental differences between fat men with large trunks and small 
limbs and thin men with small trunks and larger, more athletic 
limbs; the former tend to be more open, friendly, jolly, and 
sociable, the latter more reserved, letiring and unsociable, (c) 
personal beauty may (naturally) be determined from photo- 
graphs, (d) sex may be determined from handwriting. It is, of 
course, understood that in none of these is the relationship more 
than suggestive, nor can the diagnosis from these characteristics 
be made with much more than chance accuracy. There are also 
vague indications supporting the following small yet possibly real 
relationships (a) certain ratios of head measurements may have 
a very slight relationship to intelligence or personality qualities; 
(b) neatness, individuality, and similar traits may have some 
relationship to the expression of these traits in handwriting 

Beyond these meager findings it is safe to conclude that one 
cannot use exteinal characteristics or signs on the body to diag- 
nose conduct Conduct is a dynamic relationship between situa- 
tion and response, so that what conduct may be expected from 
a person is a secret hidden in the nervous system, only to be 
discovered by the actual reactions a person makes. 

Blackford and Newcomb claim in one of their books (7, pp. 
V, vi) : 

“Within the six years since the first publication of this 
book we have seen the plan adopted in its essentials by prac- 
tically all piogressive employers of large numbers of men 
and women in this country and also by many in other lands. 
We have seen hundreds of investigators, engineers, students, 
personnel experts and university reseaich workers studying the 
plan, experimenting with it, collecting data in regard to it 
and elaborating upon it We have seen the United States Gov- 
ernment adopt It for all industries working on government con- 
tracts, and conducting free courses of instruction in a number 
of prominent universities teaching hundreds of men and women 
how to operate the plan in industry.” 

That such a claim must be exaggerated is obvious However, 
Kornhauser (41) m 1922 made a check on the practice in in- 
dustry by sending out a questionnaire to too employment man- 
agers in industrial plants and to 100 insurance agency managers 
asking what was their practice in using a system of character 
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analysis, if any. Forty-three and twenty-two replies, respec- 
tively, were received Four industrial plants and two insurance 
agencies, a total of six, used such a system Three used Black- 
ford’s system, two used a combination of Blackford’s system 
with some other, and one did not specify the system. While it 
IS evident that Blackford’s claims are exaggerated, we can feel 
some concern that even six out of 200 should be deceived by 
claims which do not stand up in the light of experimental analysis 
and which consequently have no scientific standing. 

Although the evidence is overwhelmingly against the possibility 
of diagnostic significance in these external signs, there is need 
for further experimental study Most of the experiments described 
in this chapter have employed a small number of subjects, with 
the result that some of the correlations reported seem suggestive, 
although the possibility that they are chance deviations from a 
zeio relationship is not removed. These experiments should be 
repeated using larger numbers of subjects Again it is likely that 
some of these correlations have been produced by the fact that 
in making judgments of character the judges have used as a basis 
for their ratings precisely the physical characteristics whose sig- 
nificance they were testing. The criteria for crucial experiments 
ought to be objective enough to provide a real check. 

Let us close this section by again pointing out the brazenness 
and shamelessness with which the charlatans, and there are many 
of them, push forward their claims. They usually have something 
to sell, and they make their living by selling it. Their assertions 
are dressed up in the most atti active way. They play upon the 
common foibles and weaknesses of mankind, who live in hopes and 
who are seeking praise and encouragement. Many of the successes 
of these impostors in making correct diagnoses are due not to an 
application of their own systems but to a keen evaluation of the 
characteristics of those whom they face. 
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Chapter XV 

MEASURES OF THE ENVIRONMENT 


U P to this point the measures which have already been de- 
sciibed in this book have to do with responses of one 
sort or another. Now it remains to point out that in in- 
vestigations concerning the causes and development of conduct 
it IS often desirable to have knowledge concerning the situations 
in which conduct takes place The situations are as manifold as 
the responses, and it is as impossible to describe succinctly all 
of the permutations and combinations of the environment as it 
IS to describe all of the vagaries of conduct A first step in the 
problem of describing environmental differences is to consider the 
general richness or adequacy or level of the child’s environment. 
The very statement of the pioblem makes it one of measurement, 
for differences in richness or adequacy of the environment pre- 
suppose measuiement To desciibe all existing gradations there 
must be a scale running from the very poor, meager and insuffi- 
cient environment to the rich, full, and adequate environ- 
ment. 

It will quickly be seen that it is not possible to measure at once 
all phases of the environment, le, community, church, play- 
giound, school, and home Most attention in this problem has 
been devoted to the home, for it is there that conduct has its 
source, and beyond question the social and economic level of the 
home IS a prime factor in determining conduct Such items as 
father’s occupation, family income, home furnishings, and pos- 
sessions have been suggested as possible measures of this vague 
thing which we call adequacy of the environment As is usual in 
the problem of measuring human affairs, a search has been 
made for that item or those items which correlate most highly 
with the more general and abstract thing which we want to 
measure In the following sections will be described attempts to 
measure (a) occupation level, (b) home background, and (c) 
certain combinations of the two, and (d) attempts to get at the 
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cultural background through the cultural responses of the 
children. 

Measures of Occupational Level 

The classification of occupations has been a troublesome and 
vexing matter. Decennially the federal census wrestles with this 
problem and at each succeeding census manages by referring to 
past experience to make the occupation classification more and 
more adequate. The census emphasizes the fact that it attempts 
to classify along occupational rather than industrial lines, and in 
the schedule for enumerators an entry for the name of the occu- 
pation or type of work comes before the entry for the industry. 
The census makes the following broad classification of occupa- 
tions under nine heads: 

Agriculture, forestry, and animal husbandry 

Extraction of minerals 

Manufacturing and mechanical industries 

Transportation 

Trade 

Public service (not elsewhere classified) 

Professional service 
Domestic and personal service 
Clerical occupations 

The Bureau of the Census publishes an Index of Occupations — 
Alphabetical and Classified. The index which was used in the 
fourteenth (1920) census contains 572 main occupations and 
occupational groups 

It must be evident at once that the census classification does 
not satisfy the needs of those who are seeking a scale by which 
to measure the level of occupation. For instance, undei the 
heading “Manufacturing and mechanical industries” are to be 
found such diverse groups as “Manufacturers and officials” and 
“Laborers.” The census classification, capable of indefinite sub- 
division, IS useful for some purposes, but does not satisfy the 
requirements of a scale for measurement of social or economic 
level. 

A rough scale was projected by Taussig in his Principles of 
Economics. He desciibes five non-competing groups — “non-com- 
peting m the sense that those born or placed in a given grade 
pr group usually remain there, and do not compete with those 
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in other groups ” We will use Taussig’s phraseology in describing 
his five groups (24, Vol. II, pp. 134-137) . 


“In the lowest group belong the day laborers, so-called the 
diggers and delvers who have nothing to offer but their bodily 
strength. ... In the next group belong those who, while not 
needing specialized skill, yet bear some responsibility, and must 
have some alertness of mind. Such, for example, are motormen 
on street railways. ... In the thud group belong the aristocracy 
of the manual laboring class the skilled workmen. Such are 
carpenters, bricklayers, plumbers, machinists; the whole range 
of occupations where there is need for a sure eye, familiarity with 
tools, a deft and trained hand. . . . Next comes the group that 
approaches the well-to-do, the lower middle class, which avoids 
rough and dirty work, and aims at some sort of clerical or semi- 
intellectual occupation. Here are clerks, bookkeepers, salesmen, 
small tradesmen, railway conductors, firemen, superintendents, 
teachers of the lower grades. . . . Finally, we reach the class of 
the well-to-do, those who regard themselves as the highest class, 
and certainly are the most favoied class tiere are the professions, 
so-called, — the lawyers, physicians, clergymen, teachers of the 
higher grades, salaried officials, public and private, in positions 
of responsibility and power, not least, the class of business men 
and managers of industry, who form in democratic communi- 
ties the backbone of the whole gioup.” 


This rough scale serves as a very satisfactory beginning to later 
more detailed scales. 

Counts (12), in his study of The Selective Character of Ameri- 
can Secondary Education, states that the Tatissig scale is difficult 
to use. He found that the lines between the groups were not 
clearly defined in actual industry Furthermore, because of the 
meagerness of the information which he obtained from high school 
students by asking them the questions, “Father’s occupation^ 
Where or for whom does he work? Is he either owner or pait 
owner of the business in which he works He pronounced the 
Taussig scale “unworkable without resorting to many arbitrary 
decisions ” The classification adopted by Counts uses the census 
classification as a basis, aiming at the same time to get classes 
“of reasonable homogeneity from the standpoint of social status, 
position in the economic order, and intellectual outlook ” It recog- 
nizes the following groups. 

•From Taussig, F W, Principles of Economics, By permission of The Mac- 
millan Company, publishers. 
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I. Proprietors 
II Professional service 

III. Managerial service 

IV. Commeicial service 

V. Clerical seivice 

VI. Agricultural service 

VII. Artisan-proprietors 

VIII. Building and related trades 

IX. Machine and i elated trades 

X. Printing trades 

XI. Miscellaneous trades in manufacturing and mechanical 
industries 

XII. Transportation service 

XIII Public service 

XIV Personal service 

XV. Miners, lumber-workers, and fishermen 

XVI. Common labor 

XVII. Occupation unknown 

Sims in his “Score Card for Socio-Economic Status” uses a 
fivefold classification of occupations, which differs slightly from 
the Taussig classification (21, p. 22) 

Group I. Professional men, proprietors of large businesses, and 
higher executives. 

Group II. Commercial service, clerical service, large land-own- 
ers, managerial service of a lower oidcr than in Group I, and 
business proprietors employing from five to ten men 
Group III Artisan proprietors, petty officials, printing trades 
employees, skilled laboreis with some managerial responsibility, 
shop-owners and business propiietors employing one to five men 
Group IV. Skilled laborers (with exception of printers) who 
work for some one else, building tiades, transportation trades, 
manufacturing trades involving skilled labor, personal service. 
Small shop-owners doing their own woik 
Group V Unskilled laborers, common laborers, helpers, 
“hands,” peddlers, vaiied employment, venders, unemployed (un- 
less It represents the leisure class or retiied). 

The attempt to scale occupations according to economic-social 
level received aid from the results of intelligence testing carried 
on in the United States Aimy during the Woild War. It was 
found, after the scores on Airay Alpha wcie tabulated by occu- 
pations, that the intelligence level of an occupation was a very 
good index of its social and economic impoitance. The following 
table gives occupational intelligence s^ndards as derived from 
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testing with Army Alpha during the war and corrected by Fryer 
in later research carried on m the Central Branch of the Brook- 
lyn Y. M. C. A. 

Table 9S 

OccupATioN,\L Intelligence Standards Based on Army Alpha Intelligence 

Tests 


Score 

aver’ 

age 

Score 

range 

Occupation 

l6l 

110-183 

Engineer 

IS2 

124-1S5 

Clergyman 

137 

103-135 

Accountant 

127 

107-164 

Physician 

122 

97-148 

Teacher 

1 19 

94-139 

Chemist 

114 

84-139 

Draftsman 

III 

99-163 

Y M C A secre- 
tary 

no 

80-128 

Dentist 

109 

81-137 

Executive (minor) 

103 

73-124 

Stenographer and 
typist 

lOI 

77-127 

Bookkeeper 

99 

78-126 

Nurse 

96 

74-121 

Clerk (office') 

91 

69-113 

Clerk (railroad) 

86 

39-107 

Photographer 

85 

57-110 

telegrapher and 

radio operator 

83 

64-106 

Conductor (rail- 

road) 

82 

37-108 

Musician (band) 

81 

59-106 

Artist (sign lettcrer) 

81 

60-106 

Clerk (postal) 

81 

57-109 

Electrician 

80 

62-114 

Foreman (construc- 
tion) 

80 

56-105 

Clerk (stock) 

78 

54-102 

Clerk (receiving and 
shipping) 

78 

61-106 

Druggist 

77 

59-107 

Foreman (factory) 

7 S 

56-105 

Graphotype operator 

74 

S 3 - 9 I 

Engineer (locomo- 
tive) 

72 

54-99 

Farrier 

70 

46-95 

'telephone operator 

70 

44-94 

Stock checker 

69 

49-93 

Carpenter (ship) 


Score 

aver- 

age 

Score 

range 

Occupation 

69 

48-94 

Handyman (gen’l 

mechanic) 

69 

46-90 

Policeman and De- 
tective 

68 

SI-97 

Auto assembler 

68 

47-89 

Engineer (marine) 

68 

42-86 

Riveter (hand) 

67 

50-92 

Toolmaker 

66 

45-92 

Auto engine me- 
chanic 

66 

45-91 

Laundiyman 

66 

49-86 

Gunsmith 

66 

44-88 

Plumber 

66 

44-88 

Pipe-fitter 

65 

44-91 

Lathe hand (produc- 
tion) 

65 

43-91 

Auto mechanic (gen- 
eral) 

65 

43-91 

Auto chauffeur 

65 

42-89 

Tailor 

6s 

44-88 

Carpenter (bridge) 

64 

43-83 

Lineman 

63 

40-89 

Machinist (general) 

63 

46-88 

Motorcyclist 

63 

41-86 

Brakcman (railroad) 

62 

31-94 

Actor (vaudeville) 

61 

40-85 

Butcher 

61 

44-84 

Fireman (locomo- 

tive) 

61 

39-82 

Blacksmith (general) 

60 

38-94 

Shop mechanic (rail- 
road) 

60 

36-93 

Printer 

60 

40-84 

Carpenter (general) 

59 

40-87 

Baker 

59 

39-83 

Mine drill runner 

59 

38-81 

Painter 

58 

37-85 

Concrete worker 

58 

40-83 

Farmer 

58 

37-83 

Auto truck chauffeur 
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Score 

aver 

age 

Score 

laiige 

Occupation 

Score 

aver- 

age 

Score 

range 

Occupation 

58 


Bricklayer 

48 

21-89 

Station agent (gen- 

S7 

BSI 

Caterer 



cral) 

S7 

39-71 

Horse trainer 

40 

19-67 

Hospital attendant 

S6 

38-76 

Cobbler 

40 

19-60 

Mason 

55 

35-81 

Engineer (station- 

35 

18-62 

Lumberman 



ary) 

35 

19-57 

Shoemaker 

55 

34-78 

Barber 

32 

16-59 

Sailor 

55 

35-77 

Hostler 

31 

20-62 

Structural steel 

52 

38-96 

Sales clerk 



worker 

52 

33-74 

Horse-sboer 

31 

19-60 

Canvas maker 

51 

31-79 

Storekeeper (fac- 

30 

16-41 

Leather worker 



tory) 

27 

19-63 

Fireman (stationary) 

51 

26-77 

Aeroplane worker 

27 

17-57 

Cook 

51 

31-74 

Boiler-maker 

26 

18-60 

Textile worker 

50 

33-75 

Rigger 

22 

16-46 

Sheet metal worker 

50 

30-72 

Teamster 

21 

13-47 

Laborer (construe- 

49 

40-71 

Miner (general) 



tion) 




20 

i5-5« 

Fisherman 


Concerning this list Fryer (15) says, 

“The mean for the occupation is presented as the ‘score aver- 
age,’ and the ‘score range,’ indicating the range of intelligence 
within 'which can be expected success in the occupation, secures 
Its limits (usually, but not always) from the first and third 
quartiles The scores are so piesented as to indicate that in all 
piobability an individual must have an intelligence rating within 
the ‘score range’ for achievement in the occupation, with the 
further probability that he should be above the ‘score average’ 
to be sure of sufficient intellectual capacity for the occupation ” 

Examination of the table, however, leads to the conviction 
that even such a detailed analysis does not present sufficient 
evidence to make it possible to scale occupations at all ac- 
curately. One notes, for instance, that the range of the middle 
SO per cent of band musicians is from a score of 57 to a score 
of 108 on Army Alpha. Twenty-five per cent have scores lower 
than 57, and twenty-five per cent have scores higher than 108 
In other words “band musicians” probably covers nearly the 
whole range of scores on Army Alpha In order to place a 
man accurately on the scale, one must know more than his occu- 
pation title, and even with an accurate knowledge of his occu- 
pation, judgment must be exercised in placing an individual on 
the scale. 
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Barr tried to overcome this difficulty by building up a true 
scale of occupations, based piimaiily on the intelligence stand- 
ards of occupations He prcpaicd a “list of one hundred repre- 
sentative occupations, each definitely and concretely described, 
and had thirty judges rate them on a scale of zero to one hun- 
dred according to the grade of intelligence which each was be- 
lieved to demand.” These ratings were then treated statistically 
and P. E. values assigned to each occupation. 


P E 
value 


BvRa Scale of Occiwational Status 
Occupation Description 


0 00 Hobo 

1 54 Odd jobs 

2 11 Garbage collector 

3 38 Circus roustabout 
3 44 Hostler 

3 57 Railroad section hand . 
3 62 Day laborer 

3 99 Track laver 
420 Waterworks man 

4 29 Miner 

481 Longshoreman 

4 91 Farm laborer 
498 Laundry worker 

5 27 Bartender . . 

5 41 Teamster 

S 44 Sawmill worker 
5 59 Dairy hand 
5 81 Drayman 

5 87 Deliveryman 

6 14 Junkman 
642 Switchman 

6 66 Smelter worker 
6 27 Tire repairer 
6 85 Cobbler and shoemaker 
6 86 Munition worker 
692 Barber 

6 93 Moving picture operator 
702 Vulcanizer . 


Does heavy, rough work about circus 
Care of horses in li\ery, feed, and sales 
stables 

Replaces ties, etc , under supervision 
On street, in shop or factory as roust- 
about 

Docs heavy work under supervision 
A variety of odd jobs, all unskilled 
Digger and shovelcr, etc 
Loads and unloads cargoes 
Unskilled and usually inelTicient 
Various kinds of work m laundry (prac- 
tically unskilled) 


Heavy work, little skill required 
Milking, care of stock under supervision 

Delivers groceries, etc , with team or 
auto 

Collector of junk 

Tending switch m railroad yards 

Metal pouters, casting collectors, etc 

In general automobile repair shop 

Repairman in shoe-shop 

Average 

Not owner Has charge of chair 
Operates machine which projects pic- 
tures 

Understands the process of hardening 
rubber. 


F rom Terman, L M , Genetic Studies of Genius, Vol I, Mental and 
Physical Traits of a Thousand Gifted Children, 2d ed (Stanford University 
Press, 192s). PP 67-69. 
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P E 

value Occupation. 

7 05 General repair man . . . . . 

706 Ship rigger 

7 17 Telephone operator 
7 19 Cook 

7 23 Street-car conductor 
7 24 Farm tenants 
7 30 Brakeman 
7.33 City fire-fighter 

7 39 Railroad fireman 
7 54 Policeman . 

771 Structural steel worker 
7 73 Telephone and telegraph lineman 
7 77 Bricklayer ... . 

7 79 Butcher 

791 Baker 

8 02 Metal finisher . 

8 04 Plasterer 

8 08 General painter 

8 2^ Harness maker 
840 Tmsmith 

8 49 Letter carrier 
8 50 Forest ranger 
8 58 Stone mason 
8 75 Plumber 

8 89 Gardening, truck farming 

8 99 Electnc repair man 

9 28 Bookbinder 
9 37 Carpenter 


9 37 Potter 

9 54 Tailor 

9 72 Salesman 

ion Telegraph operator 
10 21 Undertaker 

10 26 Station agent 

10 26 Mechanical repair man 

10 29 Dairy owner and manager . . 


Description 

Repairs broken articles Uses wood- 
working tools 

Installing cordage system on sailing ves- 
sels, working under supervision 

In restaurant or small hotel 

On small tracts of land 

On freight or passenger trains 

Handles the ordinary fire-fighting ap- 
paratus 

On freight or passenger tram 

Average patrolman 

Heavy work demanding some skill 

Not shop-owner Able to make cuts 
properly 

Polishes and lacquers metal fixtures, etc 

Knowledge of materials used necessary 

Paints houses, buildings, and various 
structures 

Makes vessels, utensils, etc , from plated 
sheet metal 


Average trained plumber employee. 

Owns and operates small plots 

Repairs electric utensils, devices, and 
machines 

Sets up and binds books of all sorts 

Knows wood-working tools Can follow 
directions in various processes of 
w'ood construction woik 

Makes jars, jugs, crockery, earthenware, 
etc 

Employee m tailoring shop 

• In dry-goods, hardware, grocery stores, 
etc 

In small towm 

In small town Six months to a year of 
especial schooling 

In small town Acts as baggage man, 
freight agent, operator, etc 

In shop or factorj' Keeps machines in 
condition 

.Small dairy, 50-100 cows 
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p E 

value Occupation 

10 S3 Metal pattern-maker 

10 S4- Wood pattern-maker 

10 54 Lithographer . . . . 

1076 Linotype operator 

1083 Photographer .. . 

10 86 Detective . 

1099 Electrotyper . 

11 17 Traveling salesman 

II 34 Clerical work 

1135 Railroad passenger conductor 

II SI Storekeeper and owner 

1 1 74 Foreman 

1 1 78 Stenographer 

12 02 Librarian 

1206 Nurse and masseur 
1274 Chef . 

12 84 Editor . . , 

12 89 Primary teacher . . . . , 

12 96 Landscape gardener 

1308 Grammar grade teacher . .. 

13 20 Osteopath . . ... 

1321 Pharmacist .. . 

1329 Master mechanic . . . .. 

13 30 Music teacher ..... . . 

1331 lytanufacturer . . . . 

13 S4 Dentist . . 

13 58 Art teacher 

13 71 Surveyor 

13 31 Tram dispatcher 

144s Land-owner and operator 

14 70 Musician 

15 05 Secretarial work 

IS 14 High school teacher 

IS 15 Preacher 


Description 


. Makes prints from designs which he 
puts on stone 

City 1,000-5,000 A few months’ train- 
ing, experience m studio 

..Traces clues, etc. Employee of detective 
bureau. 

. Sells drugs, groceries, hardware, dry- 
goods, etc. 

..Bookkeepers, recorders, abstractors, etc 

Small town retail dealer, general or spe- 
cial store. 

Small factory, shop, etc 

Writes shorthand and uses typewriter 

In small mstitution or public library 

. . Graduate 

Employed in large first-class hotels 
.Small paper, considerable job work 
No college training, two years’ special 
training 

..Normal graduate, expects to make pro- 
fession teaching. 

..Trammg equal to college graduate 

..In town of from 1,000-5,000 population 

..Thorough knowledge in his field of me- 
chanics 

.2-4 years’ special training, not college 
graduate 

Employs from 10-50 men Makes simple 
articles. 

Graduate Two to five years’ experience 
m small town. 

. In high school Three or four years’ spe- 
cial training 

. Transit man City or county surveyor 
Must be mentally alert 
Very large farms or ranches 
Successful player or singer m good com- 
pany. 

Private secretary to high state or na- 
tional officials. 

. College or normal graduate Not the 
most progressive 

...Minister in town of 1,000-5,000 College 
graduate. 
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P E 

valuf Occupation 
15 42 Industrial chemist 

15 43 Mechanical engineer 

15 71 Teacher in college . . 

IS 7S Lawyer . 

15 86 Technical engineer . , . 

16 18 Artist 

16 26 Mining engineer 

16 28 Architect 

16 58 Great wholesale merchant 
16 S9 Consulting engineer 
1664 Educational administrator 

16 71 Physician . , . , 

16 91 Journalist .... 

17 50 Publisher .... . . . . 

1781 University professor .... 
1806 Great merchant 

t8 14 Musician . . . 

18 33 High national official . . 

18 8s Writer 

1945 Research leader 

19 73 Surgeon 

2071 Inventive genius 


Description 

Thorough knowledge of the chemistry of 
manufacturing processes 
. Designs and constructs machines and 
machine tools 

Degree A B or A M. Not the most pro- 
gressive 

In town of moderate size Income 

$ 1 , 000 -^ 3.000 

. . .Thorough knowledge of the processes of 
an industry 

.High-class painter of portraits, etc 

...Thorough knowledge of mining and ex- 
traction of metals 

...Training equal to college graduate 
.Business covering one or more States. 

...In charge of corps of engineers 

...Superintendent in city of 2,000-5,000 
College or normal graduate 

...Six to eight years’ preparation above 
high school Income $5,000 and up 

...High-class writer or editor 

...High-class magazine and newspaper or 
periodical, etc 

...Has AM or PhD, writes, teaches,'and 
does research 

...Owns and operates a million-dollar busi- 
ness 

, . . . (Paderewski) 

, . . . Cabinet officers, foreign ministers, etc. 

. . (Van Dyke) 

Like Binet or Pasteur, 

. . (Mayo Brothers) 

(Edison type) 


The Barr scale, used by Terman in his study of gifted chil- 
dren, apparently provides the truest means for scaling occupa- 
tions that we now have. The scale by definition places each 
occupation in order of the intelligence it demands. Although the 
matter has not been investigated, it is presumed that such a 
scale will correlate very highly with a scale made strictly to fit 
socio-economic level. 

In the Minnesota Mechanical Ability Project a scale similar 
in nature to the Barr scale was drawn up to be used in grading 
occupations according to the amount of mechanical ability in- 
volved. 
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Measures of the Home 

There are two fundamental methods of obtaining data neces- 
sary to measure the social-economic level of the home One is to 
have a visitor enter the home of a given child for the purpose 
of noting on a prepared schedule the equipment and living con- 
ditions which he finds The other is to ask the child questions 
designed to 3’'ield similar information. The former method, al- 
though more expensive in time and energy, yields the more re- 
liable results The latter method can be used in laige groups in 
school and the mfoimation can be more quickly and easily ob- 
tained, but It IS liable to inaccuracy. 

As frequently occurs, one of the earliest pieces of work iji 
the technique of measuring the home was one of the best, though 
It has since fallen into oblivescence. Perry (20) in 1912, wishing 
to study the standard of living, sought a measure of the ade- 
quacy of the home A preliminary study convinced him that the 
four rooms of a house in descending order of importance were 
the kitchen, bedioom, dining-room, and parlor His plan pro- 
posed to inventory the furnishings of these four rooms with a 
view toward determining how adequately they served the needs 
of the household He provided a check list for the furnishings of 
each of these foui rooms, adding a system of weights to aid in 
assigning ratings for each article checked The sum of the ratings 
may be used as a measuie of the level of the home The inves- 
tigator merely passes through the house, checking off on the 
schedule the articles in evidence. The ratings may be computed 
at leisure. The score is the sum of the weightings assigned, di- 
vided by the sum of the required weights, which represents a 
sort of minimum standard for an adequate household. 

A similar method has been employed by Chapin (7) in meas- 
uring the equipment of the living-room of an urban middle-class 
family. His scale consisting of fifty-three items is divided into 
four sections: 

I. Fixed features 

II. Built-in features 

III. Standard furniture 

IV. Furnishings and cultural resources 

Each item either contains directions for assigning credit or in- 
cludes a brief scale to enable the visitor to give it its proper weight 



Measures of the Environment 547 

Examples of items are: 

2. Floor covering. 

Composition i, carpet 2, small rugs 3, large rugs 4. 

21 Chair 

Straight, rocker, arm-chair, high-chair, i each. 

49 Radio 

Crystal i, one-tube 2, two-tube 3, three-tube 4, super- 
heterodyne s, etc. 

Chapin chose the living-room because it could be scored in a 
comparatively short interview which would not involve objec- 
tionable inquisitorial questions He found that the score of the 
living-room had a high correlation with the general socio-economic 
status of the family. Indeed, this measure of the living-room has 
as high a correlation with an average of several measures of 
socio-economic level as any one of the scales specially designed 
to measure geneial socio-economic level has with the average. 

Commons (ii), still earlier, developed a score card for meas- 
uring the home. This score card, in two sections, is so arranged 
that the maximum credit possible for each section is 100 points. 
Considerable judgment must be exercised m using this type of 
score card for items in which less than the maximum credit is 
granted. Until one has had much experience with the scale and 
has built for himself a set of personal standards, the assignment 
of credit is extremely subjective Commons tries to guide the 
assignment of credit by the following key: 

Instructions for Discrediting When Depending on Judgment 

Deduct from possible 6, very slight i; slight 2, marked 3, very 
marked 4, extreme 5 

Deduct from possible 3, very slight slight i, marked ij 4 ; 
very marked 2, extreme 2j4. 

A brief outline of the score card follows: 

Dwelling House Score Card 


I. Dwelling 

Location 18 

Congestion of buildings 26 

Window openings ii 

Air and ventilation 13 

Structural conditions 6 

House appurtenances 26 


100 
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II. Occupants 

Congestion of occupancy 61 

Condition of air and ventilation 18 

Cleanliness 21 


100 

A third method of measuring the home is represented by the 
“Whittier Scale for Grading Home Conditions” proposed by 
Williams (29). This is a series of five five-point rating scales for 
measuring the factors of necessities, neatness, size, parental con- 
ditions, and parental supervision. 

This scale has the problem of conduct directly in mind, since 
it contains not only Items relating to the adequacy and furnish- 
ing of the house, but two items pertaining to the social adequacy 
of the home and the parental control. We note m reading the 
Items on the scale that they apply to a rather specific situation. 
Workers who wish to use this type of scale should redefine the 
Items to fit the locality in which they intend to work. 

Measures of Socio-economic Level 

Early work by Van Denberg (26), Holley (17), Counts (12), 
and Kornhauser (18) paved the way for the later work of Chap- 
man and Sims (8). Chapman, of Yale, was attracted to the 
possibility and value of studying intensively the significance of 
various factors In the environment of a child which could best 
serve as indices of this more general thing which we call socio- 
economic level. Where previous workers had used such factors 
as parental occupation, telephone in the home, etc.. Chapman 
conceived the idea of trying out a large number of items, cor- 
relating each of the items with the most valid criterion of socio- 
economic level obtainable, and selecting the most significant items 
to form a scale for the measurement of socio-economic level. 

Chapman, In collaboration with Sims, one of his students, un- 
dertook to study the factors which influence participation in 
extra-curricular activities, using a questionnaire of sixteen ques- 
tions relating to socio-economic status. This questionnaire was 
given to the students (over 3,000) enrolled in a New Haven high 
school. Correlations were found between each item (using bi- 
serial r, since questions, were answered or «o) and a total 
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score found by using all the items and weighting each equally 
From these data a crude scale was developed. 

After the death of Chapman, Sims (21) continued the work 
on a larger scale, his woik being financed by the Character Educa- 
tion Inquiry Sims used the following criteria in assembling ques- 
tions for his original schedule. 

1. Each question must be indicative of the economic or the 
cultural level of the home 01 both. 

2. The questions must cover as many aspects of the home 
background as possible 

3. The questions must be stated so that the child can under- 
stand them 

4 The questions must ask for information which the child is 
willing to furnish 

5 The questions must ask for information which the child can 
furnish. 

6 The questions must be stated m such a manner that there 
will be a minimum chance of error. 

7. The questions must allow of answers that are comparable 

FIfty-si’c questions were used in the original try-out. The items 
which gave the highest bi-serial correlations with the composite 
are as follows; 

Tablc 96 

Bi-SEKIAL CoRREIJlTION CoEFFIClEHTS BETWEEN AsPECTS OF HoME BACKGROUND 
AND Socio-Economic Status of the Home 

(Suns, p 12) 


I 

Servants 

... 86s 

9 Books . 

• 773 

2 

Golf 

860 

10 Father h s 

764 

3 

Father’s occupation 

856 

II Lectures . . . 

758 

4 

Dancing lessons 

. 8n 

12 Mother goes to lectures 

743 

s 

Dental w ork 

793 

13 Rooms people 

739 

6 

Music concerts 

788 

14 Mother college 

735 

7 

Bank account 

786 

IS Furnace 

731 

8 

Mother h s 

775 

16 Vacation . . 

. 70s 


17 Telephone 702 


However, some of these items were answered yes by such a 
small percentage of a normal group as to be less useful than other 
items. Sims used the following criteria in selecting items for 
his scale (21, p 18) • 

1. The ability and willingness of the persons tested to furnish 
the information. 

2. The correlation of a given item with the total of the other 
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questions. Other things being equal, the higher the correlation, 
the more desirable the question 

3 The inteicorrelation between the items Othei things being 
equal, the lower the intercorrelation the more desiiable the ques- 
tion. 

4 The percentage of the population possessmg the article or 
engaged in the actwity asked about in the question Other things 
being equal, it is desirable to have the questionnaire include as 
wide a representation of items as possible, that is, incJude some 
Items possessed by many, others by few, otherwise there would 
be a tendency for a large undistributed group to form at one or 
the other extreme. 

5. The reliability of the question. 

6 The variety of aspects of home background recommended 
Other things being equal, it is desirable to measure as many 
aspects of the complex as is possible Where thcie are two or 
three questions pertaining to the same aspect, for example, of 
two questions asking for information as to lectures, it is desi ab'e 
to retain only the better one. 

7 Common sense 

Twenty-seven items were finally selected for the question- 
naire. The scoring method is rather elaborate, being based es- 
sentially on a method devised by Chapman. 

1. The percentage of those answering a question yes and those 
answering no was obtained. 

Do you have a telephone m the housed Yes No 

38570 6 i-S 7 o 

2. Using the Kelley-Wood table of the probability integral, 
these percentages are transferred into sigma values: 

-1-993 - 621 

5. These sigma values are multiplied by certain weights that 
have been assigned to each question on the basis of correlation 
with the criterion mtercorrelations, reliability, etc The weighting 
for the question about the telephones is 5. Multiplying the 
sigma values by 5, one has: 

+4965 - 3 -ioS 

4. Add 10 to each item to make all values positive: 

-[-14.965 +6895 

5. Give approximate integral value: 

+ 15 7 _ 
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These numbers (is and 7) are used as scores to be given to 
yes and no answers to this question 

Sims repoits a reliability of + 94 > correlating the responses 
of 200 paired siblings 

The care, thoroughness, and soundness of method used in con- 
structing this scale make it the most valid method available at 
present for determining socio-economic level In his monogiaph, 
Sims reports the findings in different communities which permit 
accurate comparisons between them. 


Testing for Cultural Background 

A novel method of determining the adequacy of the home 
environment, developed m connection with the work of the Char- 
acter Education Inquiry, consists in testing the cultural responses 
of children For this purpose there was developed a “Good-Man- 
ners Test” by Orr (16) and an “Apperception Test” by Bur- 
dick (4). The former is a test of manners and etiquette em- 
ploying true-false, yes-no, and best-answer techniques. The latter, 
employing a variety of techniques, endeavors to tap the richness 
of the child’s home life by questioning him in an indirect way 
about Its economic and cultural resources. Reliability coefficients 
of .621, 708, 741, and 766 were found by correlating two scores 
of the test, and a reliability of .499 by correlating the scores 
of siblings The test correlates with a criterion of home back- 
ground 659. 

These correlations indicate that this approach has promising 
possibilities. An inspection of the tests themselves, however, in- 
dicates that they probably correlate well with intelligence, and 
we are left wondering to what degree their value as a measure 
of home background is reached through the intermediary of 
intelligence There is another still more fundamental objection 
to this type of approach to the measurement of the environment 
in that it confuses the environment with the verbal responses 
which the environment stimulates In a sense, it is begging the 
question to assume that one can be used as a measure of the 
other. Indeed, it is extremely important to know exactly how 
much relationship there is between the environment and a child’s 
responses to it We feel safer, therefore, in studying the environ- 
ment directly rather than through its effects. 
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Among other attempts to measure special phases of home back- 
ground should be mentioned the “Questionnaire on Cultural and 
Mechanical Environment” which was developed and used as a 
part of the “Minnesota Mechanical Ability Study” (19). One 
section of this questionnaire deals with the “tools m the home,” 
— a list of tools to be checked for ownership by father and by 
son, yielding an inventory and measure of the mechanical re- 
sources of the home. This check list possesses high reliability 
(.94 and .89). 
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Chapter XVI 

THE CASE STUDY: A COMPREHENSIVE STUDY 
OF THE INDIVIDUAL 

W ITH the development of social work, and with the in- 
creased attention being given to individuals in education, 
more and more consideration is being devoted to case 
work and the techniques of the case study The word case in this 
connection has several ill-defined meanings The case method, 
long used in law schools, has peicolated through other types 
of professional training and has even penetrated to college and 
secondary school instruction It is a method of instruction em- 
ploying the study of cases as the source of problem material 
and as a means of making concrete the application of general 
principles Case work refers to remedial, therapeutic, or corrective 
work carried on by physicians or social workers in efforts to 
bring about better adjustments among individuals. This latter 
use of the word case is closely linked up with the case study, 
by which is meant a comprehensive, exhaustive investigation of 
an individual The case study and case work usually are closely 
intermingled. The case worker may give advice and propose 
remedies even while she is still pioceeding with the collection 
of pertinent facts In the present discussion, however, the case 
study alone is considered. 

It should be emphasized at the outset that the case study is 
not a research method. Primarily its function is to study the 
individual with a view toward helping him If the case study 
yields evidence that is helpful in scientific investigation, this is 
only a by-product and not its main contribution. If the case study 
employs a schedule of facts to be noted, and if these facts have 
been obtained in a reliable, objective manner, then these data 
may be used in research investigations. But the case study method 
contains no guarantee that its observations are complete or uni- 
form, or that scientifically valid methods were employed in get- 
ting them. The case study has the individual’s interest uppermost 
' 555 



556 Diagnosing Personality and Conduct 

in mind and may or may not employ a regular inquiry schedule 
or use consistent methods. 

The case study is just as good as and no better than the 
methods employed in gathering the data. To most persons the 
case study means the interview If the interview method of ob- 
taining data for a case study is used, then the case study pos- 
sesses the validity and reliability of the interview method How- 
ever, the case study does not need to employ the interview 
method solely, and when objective tests or questionnaires are 
available they should be used in preference to the more sub- 
jective interview. Again the case study usually means the study 
of one individual at a time, but if group methods yield as re- 
liable evidence, there is no reason why group tests or question- 
naires should not be used as part of the data for case studies 
The case study ought to employ the most objective, the most re- 
liable, the most economical, in short, the best methods of col- 
lecting data available When this is done, the data obtained in 
making a case study are welcomed as data to be used in scientific 
research. 

To be more concrete, medical science is now recommending 
the periodic health examination, which is a case study of the 
health and physical condition of an individual As part of this 
examination the subject may be asked questions about his feelings 
or his living habits But the mam part of the examination consists 
of an objective personal examination of the individual’s physical 
condition, using the most refined techniques and measuring de- 
vices that medical science has produced The examinations are 
thorough and the conclusions and recommendations are based on 
a survey of all the findings That is a case study As the records 
accumulate they become invaluable data for scientific research 
into the incidence and etiology of disease In the diagnosis of 
personality and conduct similar standards of objectivity and re- 
liability should be adopted. 

In planning a case study, careful consideration must be given 
to the items to be included m the schedule If the aim is a com- 
plete psychological case study, the practice is usually to include 
Inquiry into every factor that might have significance m under- 
standing the case. In such a comprehensive schedule questions 
will be included relating to heredity, physical constitution, de- 
velopmental history, intelligence and other special abilities, home 
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and other environmental factors, favorite activities, companions, 
personal adjustment_s, behavior characteristics, emotional sta- 
bility, reputation as repoited by paients, teachers, and fi lends, 
ideals, ambitions, wishes, and possibly an intensive study of emo- 
tional complexes Many of these blanches of the inquiry will 
yield nothing of particular significance for an understanding of 
the peculiarities of the case, but all must be investigated in the 
interests of thoroughness. Quite possibly the clue to the problem 
will be found to be in any one of the categories listed above, 
and probably the total syndrome will reveal abnormal conditions 
in several A problem child may leveal bad inheritance, constitu- 
tional weaknesses, poor home conditions, all in harmony with 
evidence as to behavior peculiarities and his own evidence as to 
maladjustment In the case study we can afford to neglect nothing 

Usually, however, since one’s purpose is the understanding of 
some particular set of symptoms, it is possible to abridge the 
exhaustiveness of the study As scientific inquiry proceeds, evi- 
dence accumulates as to the relationship of the various factors 
going to make a total personality picture With such evidence at 
hand it becomes possible to begin an investigation at once with 
the factors having the greatest probability of relationship This, 
of course, is the practitionei’s method The expeiienced diag- 
nostician, or the student m command of the pertinent scientific 
evidence bearing on a condition, does not need to undertake an 
exhaustive and complete case study He naturally uses ceitain 
tests first, and if they show abnoimal conditions, he may pro- 
ceed no further, confident in his assumption that he has the key 
to the situation. 

As an example, consider the diagnosis of reading disability. 
The symptoms — difficulty with reading — are comparatively lim- 
ited and distinct The case study, then, need not be exhaustive, 
but may proceed at once to the particular functions involved. 
Tests of deficiencies in vocabulary, word recognition, phrase and 
sentence comprehension, paragraph comprehension, visual, audi- 
tory, and motor functions, associative learning, or even nervous 
and emotional stability may be employed* Scientific work has 
already demonstrated relationship between deficiency in these 
various functions and difficulties in reading. 

*See Gates, A 1, The Improvement of Reading (The Macmillan Company, 
1928). 
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In considering what abridgment is possible in the case study, 
three cautions must be kept in mind In the first place an inves- 
tigator must draw conclusions on the basis of proved relation- 
ships It IS a natural human tendency to generalize from a few 
cases, particularly the cases with which one has had personal 
experience. Witness how only too frequently practitioners act on 
hunches, and play up fads and their pet theories. It is not too 
much to ask that judgment be based only on relationships that 
have been demonstrated to exist instead of on merely plausible 
speculations Nervousness among children has been variously 
ascribed to heredity, to undernourishment, to fatigue, to unhappy 
home conditions, to a sense of inferiority or guilt, and to mas- 
turbation. The diagnostician who rides a hobby will investigate 
his abnormality with respect to one of these factors, and if he 
finds it, will cease further search. 

This leads us naturally to the second caution, which is that 
a case study should not be abridged when one suspicious factor 
has been found, particularly when the relationship between this 
factor and the disturbing symptom is loose and uncertain Re- 
searches do not often yield categorical evidence that such and 
such a factor is a cause and another one not a cause Rather 
they show that it is a matter of degree, that the relationship 
between factors is low or high, loose or close In practice, error 
creeps in through discovering the presence of adverse conditions 
with respect to a factor that has low relationship to the disturbing 
symptom and resting there, satisfied that the cause has been 
located Where relationships are low, the search should be con- 
tinued for all possible disturbing factors. 

As an example, Hartshorne and May report that negro school- 
children cheat more on school tests than would be expected from 
comparisons with other racial groups This might be falsely gen- 
eralized into, “All negro children are dishonest ” Then a school 
principal, having in mind this imperfection generalization, might 
easily reason concerning a pupil sent to him for cheating after 
this fashion' “This pupil cheated; but he is colored, what can 
you expect^” The absurdity of this judgment is patent, and yet 
psychiatrists are daily making case studies, drawing inferences, 
and making recommendations on evidence that has less founda- 
tion than that used in this illustration. 

The third caution lies in the other direction. The author’s re- 
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view of diagnostic techniques convinces him of the value of small 
correlations In the early stages of educational research, students 
were looking for the big, obvious relationships between phe- 
nomena, and relationships represented by coefficients of correla- 
tion as low as .10 or 20 were considered negligible, unworthy 
of attention We may confidently assert that practically all of the 
marked relationships in human behavior have now been explored. 
As one example, the relation between intelligence and a myriad 
of other factors has been investigated The general conclusion 
is that the relationship is often close, but always with enough 
leeway so that other factors may have considerable influence 
in determining actual behavior or the status of the individual. 

There is a need now for the patient investigation of relation- 
ships of low degree carried out on sufficiently large numbers and 
with accurate enough measures to insure that the relationships 
are reliably determined. These low relationships are now sensed 
to be of particular importance in diagnosing conduct. It is well 
known that extreme values of factors which have low regression 
weights m the regression equation are potent for determining 
an issue Usually, because of the interrelation between factors, 
an extreme score in one of the variables tends to pull other factors 
along with it. As an example, take the relationship between cheat- 
ing in school and low economic status of the family. If a child 
comes from a very impoverished family, that fact is apt to carry 
with it low family cultural standards, and possibly unfavorable 
race, religion, companions, and whatnot, all tending to work in 
the direction of habits of dishonesty Many factors are contrib- 
utory, and no one fictor can properly be isolated as the cause, 
unless poverty, being the most extreme, can be thought of as 
dragging the others along with it. 

In a similar manner it is said that the common cold is caused 
by sitting in a draft, though many people sit in drafts without 
getting colds Along with the exposure must go a state of body 
chemistry, the prevalence of colds in the community, and other 
contnbutory factors. One factor alone stands out as the inciting 
cause because it is more extreme than the others Factors with 
low relationships become important because of their influence 
when extreme. And though research needs now to give more care- 
ful attention to low relationships, this is not to be taken to mean 
that the clinical worker is given freedom to ride his hobby. 
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With the refinement of diagnostic methods the techniques of 
measurement often become so difficult as to require a specialist. 
Often to-day the case study represents scientific synthesis. One 
specialist is responsible for the intelligence testing, another for 
testing the reflexes and senses, another for investigating the habit 
patterns and adequacy of adjustment, another for visiting the 
home, another for making chemical tests, and another for ob- 
taining the physiological measures of emotional disturbance. 
Finally some one is responsible for gathering up the results and 
interpreting their composite significance. To this end the various 
contributors may gather to discuss their respective findings In 
the work of the child guidance clinics it is common practice to 
have a case conference where the physician, psychiatrist, psycholo- 
gist, and social case worker present findings and formulate their 
significance or a program of recommendations. 

Usually the case study is nothing more than a cross-section of 
the individual, presenting a portrait of him at any one time as 
determined by the various measures and other forms of evidence 
gathered Often, however, the study of the case may be con- 
tinued through several weeks, months, or even years, with re- 
peated observations or measurements In such cases the case 
study possesses even greater significance, because to the cross- 
section is added a picture of growth, development, and continuity. 
The accumulating records of the social worker contribute to an 
understanding of the forces which have worked to bring about 
the present situation. 

Diagnosis of Character and Personality 

A diagnosis of character and personality is conditioned by 
what one means by these terms Of all terms used in psychology 
and education they are the most poorly delimited Part of the 
difficulty is due to the different ways in which people conceive 
conduct to be constituted; another part, to the different emphases 
placed on various values such as morals, adjustment, integration, 
conformity, self-expression, and the like. In general character 
refers to the habits and skills with which one faces life’s situa- 
tions, particularly such as are social, and has special reference 
to the organization and consistency of conduct. Personality re- 
fers to a more complete description of the constitutional make-up, 
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including physique, intelligence, temperament, and character. 
More specifically it sometimes refers to the adequacy of peisonal 
adjustments, also especially in social relationships 

Why should any one wish to measuie chaiacter and per- 
sonality, apart from a general curiosity with regard to these 
matters? Those who insist on the need for measuies of character 
are educators interested in obtaining evidences of development 
in citizenship and moral character, and also personnel woikers 
in industry who are seeking some method of selecting employees 
for such qualities as industry, trustworthiness, or punctuality. 

Measures of personality are most often demanded by psychia- 
trists and others inteiested m the adequacy of social adjustment. 
One might think that the diagnosis of crime was particularly a 
matter of moral character, but students of the subject say 
very little about moral character in discussions of crime. The 
criminal is looked on as a moral peivert by the common man, but 
more often the expert sees him as a man who is tU — ill-adjusted 
to the demands of social living It is quite probable that funda- 
mentally these two points of view are not so distinct as they 
seem, and both' parties could profit by a better understanding 
_of the other’s position. 

It must be admitted at the outset that any measure of char- 
acter IS an average, an abstraction There is nothing that cor- 
responds to what we call character in actual conduct Take a 
person in each of 100 situations, and in some he will be honest, in 
others dishonest, in some he will be prompt, in others dilatory; 
in some he will be aggressive, in others retiring. To say that a 
person scores 50 on a character scale helps very little in pre- 
dicting his behavior in a specific situation or in understanding his 
difficulties. As Hartshorne and May have said, what we find are 
honest and dishonest acts and not dishonest and honest persons. 
“When one speaks of an honest person performing a dishonest 
act or a dishonest person performing an honest act all useful dis- 
tinctions between honesty and dishonesty are destroyed.” The 
specificity of conduct is a fundamental fact. 

If this were the last word, we should have to drop the discus- 
sion of the diagnosis of character as unworthy of further atten- 
tion. However, facts discovered by Hartshorne, May, and Shut- 
tleworth indicate that certain values may still inhere in the 
attempt to measure character. By means of an extensive program 
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of testing m which sixty-four testa of behavior were administered 
to 850 school-children, they were able to approach the problem 
of the measurement of character empirically They used three 
criteria of character (a) reputations of the pupils obtained from 
both teachers and pupils by a variety of rating devices, (b) a 
series of too character portraits graded by competent persons on 
the basis of descriptions afforded by testing and ratings, and 
(c) measures of the consistency of behavior in terms of the 
smallness of variability of behavior 
With each of these measures there were correlated other tests 
or indices used in the Character Education Inquiry. Tables 97, 
98, and 99 give the mam facts as presented in Studies in the 
Organization of Character. 


Table 97 


Relative Significance or Performance Factors for Total Reputation • 
(from Hartshorne, May, and Shuttleworth) 


r’s for population Y, by class- 

room I 


r’s for population XYZ^ 


I 

Teachers’ marks 

412 

I 

Teachers’ marks . 

78 

2 

Opinion A 

260 

2 

Opinion B 

S3 

3 

Opinion B 

254 

3 

Opinion A ... 

49 

4 

Good citizenship 

188 

4 

Information 

49 

5 

Information 

174 

5 

Good citizenship 

40 

6 

Culture 

237 

6 

Culture 

53 

7 

Kits + envelopes (service) 

283 

7 

School honesty 

81 

8 

Athletic contest (honesty) 

240 

8 

Total honesty . 

48 

9 

School honesty 

183 

9 

Service 

63 

10 

Inhibition total 

210 

10 

Inhibition total . 

62 

II 

Persistence total 

117 

II 

Persistence total 

SO 

12 

Emotional stability 

383 

12 

Emotional stability . 

38 

13 

Intelligence (IQ) 

244 

13 

Intelligence (MA) . 

35 

14 

Resistance to suggestion 

152 

14 

Resistance to suggestion 

32 

IS 

Self-function 

071 

15 

Self-functioning 

31 

16 

Age 

121 

16 

Age 

-23 


^ Corrected for class-room heterogeneity 2 Corrected for attenuation 


Most significant for the problem of the measurement of char- 
acter was the finding that general integration correlates with the 
average of twenty-three tests 440, which corrected for attenua- 
tion becomes 73 , correlates with total reputation .400, which 
when corrected becomes .79; and correlates with the character 
portraits .344 which when corrected becomes .55. In other words, 

• From Hartshorne, H , May, M A , and Shuttleworth, F K , Studies in the 
Organization of Ch^acter By permission of The Macmillan Company, pub- 
lishers ' - " F F 


/ 
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Table 98 

Relative Significance for Character or Miscellaneous Concomitants* 
(from Hartsfiome, May, and Shuttleworth) 

Factor Test 


Group I 
Reputation 


Group 2 

Moral knowledge 
and opinion 

Group 3 
Conduct 

Group 4 
Culture 

Group S 
Personal factors 


Group 6 

Separate conduct 
tests 


Group 7 
Socio-economic 
status 
Group 8 
Age 


Conduct record 

716 

Check list 

664 

Scholastic marks 

619 

‘ Guess who” 

58s 

Deportment marks 

SH 

Total reputation 

611 

Opinion A -|- B 

602 

Opinion B 

569 

Information 

434 

Opinion A 

424 

Good citizenship 

-399 

Service total 

.584 

School honesty 

490 

Inhibition (3 tests) 

•449 

Persistence total 

442 

Burdick 

42s 

' Intelligence 

391 

Resistance to suggestion 

.323 

Mental age 

•317 

Emotional stability 

■31s 

Self-functionmg 

.314 

' Service tests 


Kits + Envelopes 

.560 

Kits 

37S 

Money vote 

.274 

Cooperation with class 

.129 

Free choice 

.09s 

Honesty tests 


lER (answer sheets) 

■41 S 

Coordination (peeping) 

41s 

Athletic contest 

•34S 

Speed (adding answers) 

.276 

Inhibition 


Total of 4 tests 

.384 

Picture inhibition 

.142 

Persistence 


Cross and square 

.282 

Persistence for self gnd class 

239 

Story resistance 

.022 

Suns’ score card 

.140 

I Chronological age 

— 074 


• From Hartshorne, H , May, M A , and Shuttleworth, F K Studies in the 
Orgamzatton of Character By permission of The Macmillan Company, pub- 
ishers. 
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Table 9 Q 

Correlation or General Integration avith Miscellaneous Measures* 
(from Ilartshornc, May, and Shuttlcworth) 


CONDUCT 

KNOWLEDGE 








Satar 

rested r' 


Ran/ r 

rested r ’ 

School honesty 

354 

61 

Good citizenship 

504 

87 

Service total 

179 

38 

Information 

315 

53 

Inhibition total ' 

296 

74 

Opinion A B 

340 

58 

Pcibistence total 

1 

066 

— 13 




REPUTATION 

ABILITY AND STATUS 



Cof- ' 





Raui r 

rested r * j 


Raul r 

rested r ’ 

Teachers’ marks 

361 

i 

CAVI intelligence 



Deportment 

232 

j 

(sigma) 

120 

.20 

Conduct record 

3 S 2 

i 

Resistance to sug- 



‘Guess who” 

223 


gestion 

246 

49 

Total reputation 

400 

70 

Emotion stability 

19*4 

37 




Sclf-functiomng 

289 

SI 




Age 

-041 

— 06 




Sims (socio-eco- 






nomic) 

138 

24 




Hurdick (culture) 

184 

33 


'Omitting the Picture inhibition test 
'Corrected for attenuation 


the more desirable the qualities a person has in general, the 
more consistent he is, and the more undesirable the qualities a 
person has in general, the more inconsistent he is. The man who 
is generally honest is also consistently honest — that is to say, 
he is dependably honest. But the man who is generally dishonest 
IS inconsistent — he may be honest in one situation and dishonest 
in another. The man with good character seems to be organized 
from within; at least, he acts uniformly and consistently with 
regard to commendable things. But the man with low character 
is more a creature of chance and impulse and is blown about by 
every change of circumstance. 

• From Hartshorne, H , May, M. A., and Shuttleworth, F K , Studies in the 
Organization of Character. By permission of the Macmillan Company, pub- 
lishers 
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The upshot of this finding is that a measure of the character 
of a person whose character is high is valuable, for since the 
person’s conduct is consistent, one can tell what he will do rela- 
tively accurately in specific situations. A knowledge that a man’s 
character is high would be of great importance in employing a 
bank teller, for it would be as good a guarantee as could be 
obtained that the person is dependable and tends to be more 
consistently honest, conscientious, thorough, accurate, and the 


hi^h 

conduct 

feuej 


htv hiph 

conduct m 
•s/fec/Y/c 
situatfons 



like in most situations in which he finds himself than the average 
man. On the other hand, knowledge that a man’s character is 
low tells us more than anything else that he is not dependable 
and that one can pi edict little about his conduct in specific situa- 
tions. He might be honest, and he might not. Probably he would 
tend to adopt the group mores, but would yield somewhat readily 
to temptation. 

In summarizing Hartshorne, May, and Shuttleworth’s results. 
It appears that the two methods best adapted for estimating a 
person’s conduct are ratings and tests of conduct knowledge and 
opinion Performance tests would also be useful if they were less 
awkward and expensive, both in time and in money. 

That ratings and paper-and-pencil tests of knowledge and 
opinion should finally emerge as the most significant indices of 
character may seem something of an anomaly. After the World 

%. ^ J 
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War, rating methods were so harshly criticized that it was widely 
pioclaimed that they were forever discredited However unre- 
liable they may be, and however much they may be biased by 
halos of general impression, they emerge as one of the best 
methods of character diagnosis To be suie, since the war there 
have been several improvements in rating techniques, the most 
important of which were developed by the Character Education 
Inquiry itself It is significant that ordinary school marks operate 
as one of the most effective single measures of character 

It may also strike the casual obseiver as strange that tests of 
knowledge and opinion should finally precipitate as a superior 
measure of character when it has been demonstrated that there 
IS so little relationship between knowledge of conduct and conduct 
Itself However, we must remember that character is only an 
average and that the one thing that perhaps helps most to 
determine the average level of conduct and also the integration 
of character, is one’s verbal organization. The evidence strongly 
supports the conclusion that character is integrated largely 
through verbal organization The person who knows what is best 
and right to do tends on the whole to be the one who does the 
best thing and the right thing He who does not know what 
IS best to do is impelled more by force of circumstances, the 
mores, and by habit. To. know that a man scores high on 
a moral knowledge test usually tells much concerning his char- 
acter and conduct, to know that a man scores low tells us 
little. 

One must agree with Hartshorne, May, and Shuttleworth that 
to get the best estimate of character, a battery or composite of 
tests should be used These workers propose two such possible 
batteries, one a group of ten tests including the “guess who,” 
check list, teachers’ marks, deportment, opinion A and opinion B, 
kits and pictures, coordination, inhibition (three tests), per- 
sistence total, and Burdick culture This group of ten tests cor- 
relates 750 with their “character portraits.” A larger battery of 
twenty-three measures correlated 721 with the criterion, which 
was raised to 81 by correction for attenuation However, the 
composite of ratings obtained from both teachers and pupils cor- 
relates 80 with the criterion, after attenuation correction Theo- 
retically, a large battery of tests is necessary for the diagnosis 
of character; practically, almost as satisfactory results can be 
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obtained simply with ratings and tests of knowledge and opinion 
about conduct 

No such elaboiate investigation is at hand to piovide authori- 
tative knowledge concerning personality and its measurement. 
If by peisonality is meant an all-round survey of an individual, 
then a wide variety of tests should be employed. A complete 
physical and medical examination indicates the adequacy with 
which the bodily machinery is functioning Intelligence tests 
should be used as measures of general level of ability. Question- 
naires can be used to determine interests over a wide variety of 
human activities Questionnaires may also be employed for a 
psychoneurotic survey to determine adequacy of adjustments, 
and special tendencies such as introversion-extroversion, ascend- 
ance-submission, etc Tests and questionnaires enable one to tap 
a wide variety of attitudes and opinions towaid moral problems, 
and personal, social, and economic issues Performance tests give 
evidence as to conduct in carefully standaidized situations. The 
nature of emotional reactions may be discovered in part by cer- 
tain of these questionnaiies, by physiological measures, and 
by the free association experiment Finally, a more detailed 
investigation of personal problems and adjustments to them 
can be undertaken by means of the interview and psycho- 
analysis In short, for a complete picture of the personality 
one would have to use the majority of the techniques described 
in this book as well as measures of physique and intelli- 
gence 

If personality is defined more narrowly, as it usually is by 
psychiatrists, so that it refers to the adequacy of personal and 
social reactions and adjustments, then the inquiry can be limited 
somewhat Intelligence and aptitude tests will help define the 
geneial level of ability and special talents or defects Peiformance 
tests can be used to obtain accurate information as to habits of 
persistence, inhibition, concentration, honesty, etc Observation 
may be used to determine certain behavior characteristics cither 
when among groups as in a class-room or singly while in an 
interview. Probably the greatest usefulness will be found in 
ratings, the questionnaire, and the interview for obtaining evi- 
dence as to adjustments toward the environment, personal evalu- 
ation, attitudes toward reality, sexual relationships, morals, and 
feelings. Finally, probing for dissociated complexes requires a 
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more painstaking technique which involves free association and 
psychoanalysis* 

What of the futuie? As this leview of research on the tech- 
niques for the diagnosis of conduct is finished, it appears that 
psychology and sociology are on the brink of a period of in- 
tensive research directed toward the development and refinement 
of these methods, the foundations for which were laid years ago 
in patient theoretical research in the laboratory The last decade 
has seen a beginning made towaid using these techniques in the 
practical affaiis of life. In the yeais immediately to follow one 
can confidently piophesy that intense and devoted research will 
greatly extend and refine the practical possibilities of these 
techniques. 

What are the directions that this lesearch will take^ 

I. There will be a movement toward standardization. Instru- 
ments now available will be widely used, so that norms can be 
derived and the instruments can be accurately used to measure 
amounts of deviation One must look to the interview, free asso- 
ciation, and psychoanalysis as the most fertile fields for producing 
the suggestions and ideas that will be appropriated by enterpris- 
ing and gifted social scientists for elaboration and experimenta- 
tion, with a view to eventual standardization. Ceitainly there 
will be a tendency toward the development of group measures 

2 There will be fundamental research concerning the reliability 
of the various techniques. The conditions affecting the results 
of each method will receive patient investigation In particular, 
there is pressing need for more light on the conditions in which 
one may expect honesty or dishonesty, the effect of informing 
the subject of the nature of the test under different situations, 
and the outcomes of involuntary diagnosis There will be studies 
made of the adequacy of the results when obtained from indi- 
viduals of different characteristics, temperaments, types of ad- 
j’ustment, and the like. Fundamental research will also be under- 
taken in methods of scoring, recording results, and interpreting 
results. 

j. One may expect a vast amount of research on relationships 

• An excellent Guide to the Descriptive Study 0/ the Personality with Special 
Reference to Use in Psychiatric Cases has been prepared by G S Amsden 
and IS published by the Bloomingdale Hospital Press, White Plains, New York 

(1924)- 
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between variables similar to the work already accomplished by 
the Character Education Inquiry. There is reason to believe that 
this study of relationships will pry into eveiy corner of the field 
and test out all possible relationships under various conditions. 
Only by such patient inquiry will it be possible to reach funda- 
mental conclusions as to the significance and validity of the vari- 
ous proposed measures 

4. This study of relationships will lead to exhaustive studies 
of the Items to be included in tests, questionnaires, and inter- 
views. At present there is a feeling that in an interview a rather 
comprehensive survey is required in order to avoid leaving out 
any important item. But with the accumulation of relationships 
It will be possible to draw up schedules of the necessary items 
on which information must be obtained in order to diagnose 
problems in various correlated fields. 

5. The writer believes that the development of various types 
of questionnaires and rating methods promises to yield the re- 
sults which will be of most value in the immediate future. Because 
of the significance of language in the control of conduct, particu- 
larly the higher levels of conduct, it is probable that methods 
employing language such as the questionnaire will prove of fun- 
damental value. Measures of the environment and of objective 
observation techniques also bid fair to reward effort expended 
upon them. Performance tests are fundamental but exceedingly 
costly to administer. 

The movement toward the investigation and development of 
techniques for the diagnosis of conduct, now reaching larger pro- 
portions every day, promises much for the better control of^ 
human affairs. ' 
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Educational age, 333, 334 
Effect Cause and. Test, 271, 277, 285- 
288, 290 

halo, 107, 111-113 

Effort in school, measures of, 333-336 
Ekdaiil, a G , 375 
Elasticity of blood-vessels, factor m 
blood pressure, 411 
Electra complex, 489 
Electrical 

current, see Current 
resistance, 423 427 
Electrode 424, 425, 427 

security of contact of, factor in 
psychogalvanic reflex, 427 
temperature of, factor in psycho- 
galvanic reflex, 428 
Emergency behavior, 400 
Emotion, 376-379, 382, 384, 421, 429- 
431, 436, 488, 489, 496, 498, 517 
in answering questionnaire, 139 
variations with, 

of blood pressure, 413 
of breathing, 419 


Emotion. 

vaii.itiuns with (continued) 

of ps) chogalvanic reflex, 429 
of pulse, 406 
Emotional 

complex. 432, 433 
excitability, 437, 438, 440 
instability, 191 
maladjustment, 194 
maturity 193 194 • 
reactions in the interview, 467, 468 
stability, 562-564 
states, 393 
susceptibility, 437 
tension, 394 
Emotionality, 187, 192 
score, 191 
Emotions 

Group Scale for Investigating 
(Presscy), 189, 192 
physiological measures of, 12, 400- 
4+9 367 
use of, 430-437 

Employee, 42 

Energy of heart, factor in blood pres- 
sure, 411 413 

Eng, H, 407, 41S-417, 420 
Envelopes Test, 326, 562, 563 
Environment, 13 

description of, I2, 13 
home, measure of, 146 
measures of 12, 536-554, 569 
Questionnaire on Cultural and Me- 
chanical, 553 
questions of, 146 
Epileptics, 387 

Equipment of the living-room, measur- 
ing, 546. S+7 
Erlanger, J , 410 
Erogeneous zones, 487 
Error 

average in rating, 48-50 
direction of, m rating, 48-50 
in observation, 24, 498 
of accuracy m testimony, 475, 
498 

of interpretation in psychoanaly- 
sis, 498 

of interpretation in testimony, 47s- 
477 

of memory, 498 

of omission in testimony, 475 
Escape, 471 

Essentials to observation, 24-30 
Estimates, 26 

of ability, 478 
of character 521, 522, 566 
Ethical 

Discrimination Test, 260, 267 
Perceptiou Test, 268-279 
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Ethical (continued) 

-Social Vocabulary Test, Schwes'- 
ingcr, 270 285, 286, 288 
standards. 290, 291 
Etiquette, 1552 

Evaluation of psychoanalytic tech- 
nique, 498-501 
Evans, A L, 519, 520 
Evasions in the interview, 468 
Eierett, E M, 187 
h.vidence, types of, 12 
Examination 
health, 556 

Illinois Intelligence, 333 
medical, 567 
oral 478 
physical, 567 
psychiatric, 17 

Excitability, emotional, 437, 438, 440 
Excitement, 475 

freedom from, essential in obser- 
vation, 30 
Executives, 253 
Exercise 

breathing with, 419 
variations of pulse with, 405, 406 
Experience, 391 
clinical, 6 

variables record, 145, 146, 181, 
1S2, 185 
Experimental 

work in free association, 366 
Experimentalist, 7 
Expiration-inspiration ratio, 435 
Expression 

extreme, 141 
facial, 13 

Exsomatic current, 423 
External signs of conduct, 12, 13, 503- 
S3S 

Extra-curricular activities, il 
Extreme 

opinions, 233, 234 
ratings, 103, 114 
Extrinsic associations, 368 
Extroversion, see Introversion 
Extrovert, 170, 171, 201, 234 
Eye-control, test of, 331, 332 
Eyes, 24, 25 


Fables, comprehension of, tests, 267, 
268 

Facial profile, 504, 519, 520 
Fact 

of crime, diagnosis of, 8 
score, 306, 31 1 

Factor, acquaintance, in rating, 108, 
109 

generosity, in rating, 96-98 


Factors influencing 

acidity of saliva, 439 
acidity of urine, 439 
blood pressure, 411-413 
blood tolume, 416 
creatinine coiicenti ation of urine, 
439. 440 

psychogalvanic reflex, 426-429 
rate and amplitude of breathing, 
418-420 

reliability of rating, 98-108 
response in free association, 371- 
373 

Facts, questionnaires to obtain, 123, 
123-138 
Faculties, 505 
Faculty psychology, 22 
Fair-nimdedness, measurement of, 161, 
216, 217 

Fallibility of observation, perception, 
inference, judgment, 15, 16 
Fahb, C B, 512 

F XTERSON, H F , 509 
Fatigue, 27, 371, 376, 517 

factor in blood pressure 412 
factor in psychogalvanic reflex, 
428 
Fear, 492 

distraction tests, 332 
Ffciiner, G T, 52 
Feeble-minded, 28, 380 
children, 388 
Feeling, 223, 224 

of difference score, 74, 75 
Feelings of infeiiority, 493 
Feincold, G a , 334 
FtLEKY, A M , 435 
Fere, C, 420, 421 
Fere’s phenomena, 421 
FcRENcri, S , 486 

Fernald G , 268, 279, 287, 322, 323 

Fifteen Puzzle Test, 307 

Filfr, H a , 66 

Filter. R O , 329, 331 

Finality of judgment, 341, 343, 346 

Finding observation, 30, 31, 38, 39 

Five Rules Test, 272 

Fixation, 487, 489 

Flexibility, 340, 343, 346 

Fluidity of response, 339 

Folin, O , 438 

Folin colorimetric method, 438, 439 
Follow-up letters, 129, 130 
Food, factor in treatinine concentra- 
tion of urine, 439 
Forcefulness, 339 
Foresights Test, 273, 274, 277 
Formal titration method, 438 
Forms, tabulation, 131 
FsAi^tuN, E E, 239-241 
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Franzen, R, 74. 270, 272, 277, 310, 
313- 114 

Free association, 12, 14, 361-399, 430, 
431. 48s, 495 496 498-SOOi 567. 
568 

and insanity, 383-387 
as measure of ability, 387-389 
as measure of interest 389-391 
classification of responses in, 368- 

371, 388 

reliability of, 372 
definition in, 388 
experimental tiork in, 366 
factors conditioning the response 

m. 371-373 

instruetions for, 365 
key for scoring, 390 
list, group method of giting 364 
List, Jung, 363, 364 
List, Kent-Rosanoff, 362 363, 377, 
381. 385, 386 388 390 
qualitative analysis of, 384 
recording time in, 364 
Freedom 

from excitement, essential in ob- 
servation, 30 

from habits of interpretation, es- 
sential in observation, 29 
fiom load, 339, 340, 343 346, 347 
from pathological states, essential 
in observ ation 27 
from prejudice essential m ob- 
servation, 29, 31 
Freeman, F N , 337 
Freudian, 490, 491, 498 
Freud, S , 144, 14s, 486-488, 492-495, 
497-500 

Freyd, M , 65, 66, 86, 147, 159, 183, 
197, 242, 246-248, 252, 350, 390 
Fryer, D , 239, 241-244 540, 541 
Functional mental disorder, 431 
Furffy, P If , 95 

Furnishings of the home, inventory of, 

546 

Furst, E , 362 


Gall, F J , 504, 505 
Gallup, G H , 94 
Galton, F , 52, 78, 361, 362, 368 
Galvanometer, 421-424, 427, 430, 431 
deflections, 376, 377 
Games, 313 

Garretson, 0 K , 250, 251, 253, 255 
Garretson Interest Questionnaire, 250, 
251 

reliability of, 252 253 
validity of, 253-256 
Garrett, H E , 93, 187, 193, 508 
Gates, A. 1., 263 „ 


Gates-Strang Test of Health Knowl- 
edge, 263 

Gage, ancioid, 409 
General trait, 107 
Generalization, 7 
Gcneialirations Test, 217, 218 
Generosity factor in rating, 96-98 
Gesture, 473 
Gibson, S M , 329 
Gifted children, 243, 389, 545 
California surtcy of, 104, 389 
Gilbert, J A , 3 19 
Gildemeister, M , 43 
Giles, J T , 266 
Gilliland A R, 331, 332 
Glands, 510 

adrenal, 440 
pituitary, 440 
sweat, 400, 421, 425-427 
Glandular secretion, 440 
Glasses, 25 

Godefroy, j C L , 423 

Goethe, J W , 525 

Goodenough, F , 33, 36 

Good judge charai.i.eristics of, 100, loi 

Goring, C, 515, 516 

Gowim, E B , 506 

Graphic rating 

method, advantages of 65 
scale, 45, 62-72, 76, 77, IS 9 , 180, 
196, 197 
reliability of, 95 
rules for making, 65, 66 
Graphology, 525-528 
Gravity, factor in blood pressure, 411 
Gray, \V S , 459 
Griffiths, C H , 456 
Grip, 379 
Group 

conduct, 292, 293 
method of giving free association 
list, 364 

reactions, 372, 374 
Scale for Investigating the Emo- 
tions (Pressey), 189, 192 
Growth, observation of 34, 38 
Guess Who Test, 73, 74, 563, 564, 

S66 

reliability of, 73, 74 
Guidance, 6, 14, 15 
vocational, 516 
Guilford, J P , 193 
Guilt, 394 

detection of, 361, 382-385 
test of, 382-385 

Gullette, R , 187, 192, 377, 434 
Gundlach, R 'H , 303, 509, 512, 513 
Guthrie, E R , 165, 199, zoi, 202, 
312 

Gutzmann, H, 393 
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Habit, 292, 560 
lists, 31 
Habits, 

consistency of, 35 
nervous, 31, 33 

of interpretation, freedom from 
essential in observation, 29 
Haccertt, M E , 68 
Haines, T H , 268, 289 
Halo effect, 107, 111-I13 
Hamilton, G V , 4.54 
Hand 

dy nanometer, 313 
shape of, 504, 524 

Handwriting, 13, 338-342, 504, 525- 
529 

and dishonesty, 326 
and sex, 326, 329 
Hanna, J V , 107 
Hanson, W L , 266 
Harper, M H , 167, 232, 233 
Hart, H N , 110, 141, 224-227 
Hartman, D A , 228 
Hartman, R , 242, 243 
Hartsiiorne, H, 3, 20, 72, 73, 113, 
130, 153-156, 270, 271. 283-293, 
298, 299, 302-307, 3 10-3 14, 316, 
317. 325, 327, 353 , 354 , 521, 
558 , 561-566 
Hayes, H , 461 
Hayes, M S , 94 
Head, measurements, 518, 319, 529 
Health 

Education Tests, 270, 277 
examination, 336 

Knowledge, Gates-Strang Test of, 
263 

Healy, W , 268, 287, 289, 474 

Hearing, 412 

Heart-beat 

measurement of amplitude of, 
407 

measurement of rate of, 403-407 
Heart, energy of, factor in blood pres- 
sure, 411,413 

Heidbreder, E , 147, 165, 170, 196, 
197, 200-202, 204, 508, 509 
Heidbreder inferiority attitude self- 
rating scale, 309 
Height, 504, 306, 507, 528, 529 
Heinlein, C P, 28 
Helpfulness, 353 
tests, 302 

Henriques-Sorenson method, 438 
Hepner, T W , 72 
Herskowitz, M j , 348 
Hesketh, F E, 511, 312 
Hightower, P R , 289 
History, case, 494 
Histrionic associations, 368 


Hoffman, G J, 110, hi 
Hoitsma, R K , 164, 163, 1R5, 187, 
201, 202 

Holley, C E , 548 

Hollerith tabulating machine, 131, 
132 

Hollingworth, H L, 21, 99, 100, loi, 
103-107, 109, HO, 112, 174, 186, 
187, 478, 499, 521 

Home 

background, 336, 330, 332 
Conditions, Whittier Scale for 
Grading, 348, 549 
environment, measure of, 146 
furnishings, inventory of, 546 
measures of, 146, 346-349 
neatness of, 348 
necessities of, 348 
Sims Score Card for Measuring, 
5 + 7 . 548 
size of, 348 

socio-economic level of the, 536, 
546 

Test, 313 

Honesty, 18, 361, 364, 568 
measures of, 113, 333 
school, 362-564 
tests, 130 302-318, 362-364 
Honor Society, National, 12 
House, S D, 145, 146, 164, 180, 181, 
183, 183 186, 188 

House, Woodworth-, Mental Hygiene 
Inventory, 164, 180 
Howell, H W , 414 
Hudbard, L M , 38J 
Hubbard, R M , 232, 233 
Huciirs, W H, 85, 94, 99 
Hull, C L , 91, 92, 323, 364, 380, 
381, 319-521, 524, 525, 527 
Humor, sense of, 457 
Hunger, 486 
Hurlock, E B , 109 
Hydrogcn-ion concentration, 437 
Hygiene Mental, see Mental 
Hyper-thyroidism, 307 
Hysteria 180, 431 
Hysteroid, 180 


Idea of right, deviation from accepted, 
score, 74, 75 

Ideal, Self-Ordinary, Rating, 74-76 
Identification, 490 
Idiosyncrasy score, 161, 191-193 
Idiot, 388 
I E R tests, 563 
Illinois Intelligence Examination, 333 
Illusion, size-weight, 318, 319 
Imagery, 368 

locale of Mental, 53 
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Imbecile, 388 
Importance, 93 

social, basis for selecting items in 
observation, 31 

Improbable acliiev'ement technique, 
306-308 
Improv cment 

111 rating, 114 
prognosis of, 9, 10 

Impulses, coordination of, 34.2, 34.3, 

346 

Impulsion, motor, 340, 343, 346 
Inaccuracy in perception, 29 
Incomplete 

rankings, combining, 92. 93 
ratings, combining, 92, 93 
Incorrigibility, 307 
Incriminating questions, 137, 138 
Independent ratings, 46, 95, 96 
Index 

Lie, ISO, 151, 3 IS 
Morphologic, 187, 193, 204, S04, 
S07-S09 

Studiousness, 334-336 
Truthfulness, 31S 

Indiana Survey of Religious Education, 
302 

Indicators, complex, 496 
Individual, 

comprehensive study of, 555-569 
psychology, 492 
questionnaire to study, 125 
rating one, at a time, 80, 81 
reactions, 370, 371, 385, 391 
Industrial classification of occupations, 
537 

Industry, 335 
Infantile reactions, 145 
Inference 

fallibility of, 15 
Test, 220-222 
Inferiority, 492, 493 

Attitude, Self-rating Scale Heid- 
breder, 509 
feeling of, 493 
score, 74, 7S 
Information 
social, 235 

Test, 311, 312, 562, 563 
Laycock Biblical, 266 
Inhibition, 353 

motor, 341, 343, 346, 347 
tests, 302, 327, 328, 562-564, 

566 

mtercorrelations of, 327 

Initial 

contact in interview, 459 
interview, 474 
polarization, 425 

Inner associations, 368, 374 „ 


Inquiry, Character Education, see 
Character 

questionnaire used in original, 125 
statistical, 6, 18 
Insanity, 28 361, 387, 394 

and association method, 385-387 
causes of, lo 
diagnosis of, 8, 9, 10, 17 
kind of, 10 

manic-depressive, 387, 511, 512 
Insight, social, score, 74, 75 
Inspiration-expiration ratio, 435 
Instability, emotional, 191 
Instructions for free association, 365 
Integration, 562 

of character, 566 
of personality, 377 
Intellectual interest, 389, 390 
Intelligence, 168, 195, 233, 244, 288. 
321, 329, 387, 522, 529, s6i- 

563 

Examination, Illinois, 333 
occupational standards, 540, 541 
Scale 

Binet-Simon, 15, 267, 268, 351 
Pearson’s for judging. 53, 54 
tests, II, 17, 567 

American Council, 187, 202, 
S08 

Army alpha, 539-541 
C A V D, 423 
C A V I, 564 
Disguised, Snedden’s, 454 
National, 287 

Thorndike, for High School 
Graduates, 187, 255, 324, 
328, 508 

Intelligent, 372, 387, 475 
reaction time of, 37s 
Interchurch World Movement, 302 
Intercorrelations 

of deceptive behavior, 317 
of Downey - Will - Temperament 
Tests, 345, 346, 351 
of inhibition tests, 327 
of measures of deceptive behavior, 
317 

of measures of speed of decision, 
330 

of moral knowledge tests, 286 
of persistence tests, 325 
of service tests, 326 
Interest, It, 170 

activity, 389, 390 
and ability, 239, 242-245 
Blank Vocational, 169, 250, 253 
distribution of commercial, 251 
free association as measure of, 389- 

391 

m detail, 341-343, 346 
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Interest {contmvrd) 

intellectual, 389, 390 
measurement of, 245-251 
medical, scale 248, 249 
permanency of, 239-242 
Questionnaire, Garretson, se^ Gar- 
retson 

questionnaires, 12, 239-259, 567 
Report Blank, 166, 248 
social, 389, 390 
Test, 224, 225 

Interpretation, 16, 17, 28-30 
errors of, see Errors 
freedom from habits of, essential 
m observation, 29 
of dreams, 500 
Interrogation, 391 
Interval, time, m observation, 33 
Interview, 6, 7, 12-15, 18, 450-485. 55^, 
567, 568 

accuracy of, 453, 454, 475-479 
advantages and disadvantages of, 

451-454 

agreement in, 466 
appointment in, 459 
choice of questions in, 471, 472 
clima,t and denouement, 460, 473, 
474 ^ ^ 

compared with questionnaire, 451- 

conclusion in, 460, 474 
conditions of, 459, 460 
denouement in, 460, 473, 474 
diagnostic, 450, 451 
diagnostic summary in the, 480 
emotional reactions in, 467, 468 
evasions in, 468 
initial contact in, 459, 474 
introduction in, 460-462 
involuntary, 14, 457, 458 
leads in, 452 
motivation in the, 464 
non-verbal elements in, 472, 473 
note-taking in, 479 
observation in, 37 
office, 459 

physical contact in, 467 
preparation for, 458-460 
preview m, 458 
privacy m, 459 
procedures in, 459-474 
process, 479, 480 
questions in, 469-472 
recording in, 479, 480 
reliability of, 475-499 
research, 450, 451, 458 
resistance in, 458, 464-468 
rising action in, 460, 462-473 
schedule, 454, 461 
single, 474 , 


Interview {continued) 

standardization of, 6, 454, 455 
steps 111, 459-474 
suggestion in, 452 
tension in, 464-468 
tieatment, 451 
voluntary, 457, 458 
Interviewer, 15, 27, 455-457 
training of, 457 
Interviews, types of, 450, 451 
Iiitiinsic associations, 368, 369 
Introduction in interviewing, 460-462 
Introspection, 394 

of children, 143, 144 
Introversion, 21, 22 

characteristics of, 197-199 
-extroversion, 567 

distribution of, 200 
occupational differences in, 
202, 203 

performance tests of, 200 
questionnaire to measure, 165, 
166, 195-205 
reliability of, 201 
validity of, 202-205 
set differences in, 204 
Introvert, 170, 171, 201, 234 
Inventory 

Bern renter Personality, 208 
Colgate Personal (Laird), 158, 
165, 179, 180, 196, 201, 202 
of home furnishings, 546 
Psychoneurotic, see Psychoneurotic 
Thui stone Neurotic, zo8 
Woodworth Psychoncurotic — see 
Woodworth 

Woodworth-House Mental Hygi- 
ene, 164, 180 
Involuntary 

diagnosis, 14, 15, 568 
interview, 14, 457, 458 
I Q, 103, 187, 235, 243, 287, 351, 
562 

Item 

answering every, 141, 142 
m a rating scale, should be ob- 
servable, 84 
multiple-choice, 140 
occuircnce of, in observation, 33 
selection, based on a social impor- 
tance, 31 

based on social selection, 147, 
148 

by analysis, 147, 148 
empirical method of, 

for observation, 31, 32 
for questionnaire, 147-149 
to be observed, description of, 32 
true-false, 140 
validation by rating, 147 
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Item {contmued) 
yes-no, 140 
number of, m a rating scale, 83 
weighting, see Weighting 


JtcoBSEN, C, 187, 192, 377 
Japanese Cross Test, 324, 563 
JtFrRESs, L A , 42s, 426 
Johnson, B , 179 
Johnson, I 375 
Jones, E 368 
JoNLS E b, 167, 233, 235 
Judge characteristics of good, too, 101 
differences in ability to, 99-104 
Judging i.itclligencc, scale for, 53, 54 
Judgment, 79, 27 

consensus of in determining scor- 
ing key, 157 158 
fallibility of, IS 16 
finality of, 341, 343. 34<5 
in rating 31 

made anab tical by ratings, 43 
made representatiie by rating, 43 
moral, tests, sce Moral 
tests of, 12 260-297 

reliabilitt of, 284 285 
validity of, 285-294 
variability of, 102 

Judgments, order-of-mcrit, combining, 

86, 87 

Judicial capacity, 99, 100, 103 
Jump, standing broad, 313 
Jung, C G 147 196, 361-366, 368, 
370-376, 379-381. 387 391 430, 

486 

Jung Free Association List, 363, 364 


K VKisE, H , 394 
Kvlmus, If T, 428 
Kelley, T L , 88, 91, 159, 162, 244, 
248, 282, 302, 335, 336, 388, 

389, SSI 

Kelley-Wood Table of the Normal 
Probability Integral, 88, 91, 551 
Kellogg, W N , 187, 508 
Kent, G H 361-363, 370, 371, 374, 
377- 381, 383-388, 390 
Kent-Rosanoff Free Association List, 
362, 363, 377, 381, 38s. 386, 
388, 390 
Key, see Scoring 
Kikdeil, V S . 110 
Kinetic Will Test 322, 323 
King, L H 244 
Kincseutt F, a . 47 
Kits Test 326. 562, 563, 566 
Kn SON. H D , 506 
J-. 392, 393 


Knee-Jerk, 379 

Knight, F B , 43, 61, 74, 77, 97, 98, 
108 iiz, 113, 310, 517-519 
Know ledge 

and judgment, tests of, see Judg- 
ment 

and opinion, see Opinion 
biblical, tests of, 264-267 
Health, Gates-Strang, Test of, 263 
moral, and age, 288 

and conduct, 289, 290 
moral, tests, see hloral 
of conduct, tests of, 12, 260-297, 

S63-567 

intercorrelations of, 286 
reliabilit) of, 284 285 
validity of, 285-294 
Test, Adv anced Bible 266 
Word, Thorndike Tests of, 261 
Koiis, C H, 380, 391, 394 
Kohs, S C, 260, 267 
Koll, a G , 410 

Kornhauser, a W, 94 95, 529 548 
Krvmer, F , 391 
Kr.vpelin, E , 361, 368 
Kretschmtr, E 504,507 509-513.514 
Kretschmer's types 504, 509-514 
Kymograph, 392, 402-404, 410, 416-418 


Labor, factor in blood pressure, 413 
Lactic acid, 412 

Laird, D A, 158, 159, 165. 171, 179, 
180, 183, 196, 197, 201-Z04, 208 
Landis, C 51, 98, 100, 187, 192, 377 
420, 425, 428 430 432, 434, 435 
Lvnofeld, H S , 375 382 
Langlie, T a , 244 
Language and conduct. 18 
Larson, J A , 407, 432, 434 
Lashley, K S , 505 
Laslett, H R , 390 
Lay cock, S R , 266 
Laycock Test of Biblical Information, 
z66 

Le.vch, H M, 383 

Leadership I2, 204, 506 307, 509, 529 
Leading questions, 135-137, 471 
Leads in intervievr, 452 
Lehmann plethysmograph, 415, 416 
Leibnitz, G W , 525 
Length of observation period, 37, 38 
Lentz, T F , Jr , 147, 193 
Leptosome type, 513 
Letter, follow-up, 129, 130 
preliminary, 130 

Level, see Socio-economic, Occupation 

Lever arm, 403 

Libido, 486 

Lie detector, 43^ 
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Lie Index, 150, 151, 31S 
Lindbergh, C A , so 
Lindsay, E E , 99, 129 
Lipmann, O , 392 
List, check, see Check 

free association, see Free association 
habit, 31 

Thorndike Word, 261, 364 
Living-room, measuring the equipment 
of, 546, S47 

Load, freedom from, 339, 340, 343, 346, 
347 

Loubroso, C, 432, 504, 314-316 
Long, J A , 160 
Loomis, A M , 32 
Loss of reliability, 79 
Lowe, G M , 268, 289 
Lowell, F , 372 
Ludgate, K , 322, 323 
Ludwig, K F W , 402 
Lugoff, L S , 364, 380, 381 
Lying, 303, 432, 433 
test of, 382-383 
tests of, 313 


MacAlister, a , 304, 305 
MacLaurih, D 0 , 324 
Macrosplanchnic, 307 
Magic 

number square, 324 
Square Test, 314 
word square, 324 
Maladjustment, emotional, I94 
Maller, J B , 298, 325, 327 
Man, Mystery, Test, 314 
Maniac, 393 

Manic-depressive insanity, 387, 511, 312 
Manipulation Test, 327 
Manners, Good, Orr Test, 264, 270, 
286, 332 

Manometer, 408, 409 
Manson, G E, 104, no, 328 
Mantegazza, P, 32 
Man-to-man rating, 37-61 
reliability of, 93 
Manual, rater’s, 47, 48 
Marbe, K , 394 
Marey, E j , 403, 404, 417. 418 
Marks 

of deportment, 363, 364, 366 
school, 79, 478, 362-364, 366 
Marston, L R , 163, 166, 196, 199- 
201, 204 

Marston, W M , 367, 407, 432-434 
Martin, H E , 372 
Mateer, F , 386 

Mathews, E , 164, 178, 183-187, 193 
Mathews, Woodworth-, questionnaire, 

193 


Maturity, emotional, 193, 194 
Maitisby, W S, 461 
May, M a, 3, 20, 72, 73, 113, 130, 
133-136 270, 271, 283-293, 298, 
299, 302-307. 310-314, 316, 317, 
32s, 327, 337, 349, 333. 3S4, 
338, 361-566 
Maze Test, 306, 307 
McCabe, F E , 321 
McCall, W A , 160, 233, 433 
McCall-l/mg method of weighting 
Items, 160 

McCurdy, J T , 174 
McDowell, R J S , 426 
McFadden, j F. 330 
McGeoch, j a , 164, 165, 191 
Measure 

adjustment, questionnaire to, 12, 
174-214 

attitude, questionnaire to, 12, 213- 

238, 367 

interest, questionnaire to, 12, 239- 

239, 567 

mtroversion-extroversion, question- 
naire to, see Introversion 
of ability, free association as, 387- 
389 

of character, rating as, 49 
of citizenship, ratings as, 49 
of equipment of livmg-room, 346, 

svr 

of home environment, 146 
of interest, free association as, 389- 
391 

of mental unbalance, 9 
of prejudice, 161 
Measurement, 19, 450 
observation as, 32-36 
of amplitude of breathmg, 417-420 
of amplitude of heart-beat, 407 
of blood pressure, 407-414 
of blood volume, 414-417 
of conduct, 4, 3, 7, 298 
of fair-mindedness, 161, 216, 217 
of interests, 243-231 
of personality, 360, 361, 367, 368 
of psychogalvanic reflex, 420-429 
of rate of breathing, 417-420 
of rate of heart-beat, 403-407 
scale of, traits as, 30-32 
Measurements, 

biochemical, 437-440 
head, 318, 319, 529 
physiognomic, 318, 520 
Measures 

consistency of, 20, 21 
of ability, 10, 21, 298 
of aggressiveness, 331-333 
of caution, 328, 329 
oLcharacter, 361, 362, 366 
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Measures (conttntied) 

of deception, see ncrcit 

of effort m school, 333-336 

of emotions, sec Emotions 

of environment, IJ, S 36 -SS 4 SoQ 

of home 546-5+9 

of honesty, 113, 353 

of intelligence, see Intelligence 

of occupational level, S 37 -S 4 S 

of physique, 567 

of reactions, 12 

of responses, 536 

of results of conduct, 12 

of social attitude, see Social 

of socio-economic level, 548, 550- 

SS-. 563 , - . 

of speed of decision, see opcca 
of studiousncss, 333-336 
Measuring 

Church School Textbooks, Score 
Caref for, 56, 57 

the Home, Score Card for, 547^ 
548 '■ 

Mechanical Ability Study, Minnesota, 

545 . 553 , „ , , „ 

Mechanical and Cultural Environment, 
Questionnaire on, 553 
Mechanisms, 489-491, 497 

breaking defense, 468, 469 , 
of mental release, 4 i 38 r 
Medical 

examination, 567 
interest scale, 248, 249 
MriER, N C , 348 351 
Melancholic temperament, 393 
MrLTZER, H , 432, 470 
Memories, 146 
Nfemory, error in, 498 
Mendelssohn, M , 426 
Menstruation, factor in blood pressure, 

413 

Mental , 

Ability, Terman Group Test of, 

351 

activity, factor in psychogalvanic 
reflex, 429 

age, 321, 334, 351, 563 
balance, prognosis of, 9 
disease, classification of, 10 
disorder, 431 

Hygiene Test, Colgate, 164, 185, 
1 87, 203 

Hygiene Inventory, Woodworth- 
House, 164, 180 
Imagery, Scale of, 53 
Morgan Group, Test, 351 
release, mechanisms of, 468 
test, see Mental 

unbalance, degree of, measure of, 


Mercury manometer, 409 

Messer. A , 394 | 

Metabolism, 402, 438-440 

Method 

ausculatory, 409 
ausfrage, 394 
aussage, 391 

-association, 391, 392 
case, 555 
combination, 392 
distraction, 393 

empirical, of selecting items, see 
Empirical 

Folin colorimetric, 438, 439 
formal titration, 438 
free association, see Free associa- 
tion 

graphic rating, see Graphic 
Henriques-Sorenson, 438 
of making ratings, 52-76 
of paired comparisons, 54, 55 , 7 ° 
of rating cross-on-a-line, 62 
of weighting Items, McCall-Long, 
160 

ordcr-of-merit, 55, 76 
perception, 392, 393 
reproduction, 391 
scientific, 23 
sentence, 393, 394 
Methods 

of scoring questionnaires, 156-161 
rating, see Rating 
statistical, 6, 7, 20 
testing, 6, 139-142 
M I see Morphologic Index 
Microsplanchnic, 507 
Miller, G F, 315 
Mimeographing, 132 
Miner, J B , 85, 94, 106 
Minnesota, 

College Ability Tests, 508 
Mechanical Ability Study, 545, 553 
Mirror galvanometer, 421, 422 

Mohr, G J, 509, Si*. 5 i 3 ^ 

Money Voting Test, 326, 563 
Monroe, R , 459 
Monroe, W S , 333 
Montgomery, R B , 527 
Moore, B V , 242, 245, 246 
Moore, H T., 216, 331 
Moral 

Judgment Test, 221, 222 
judgment tests, 268-279 
knowledge and age, 288 
and conduct, 289, 290 
knowledge tests, 285-294, 302, 563, 

S66, 567 

intercorrelations, 286 
reliability of, 284, 285 
validity of, 285-294 
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Mora-vesik, M H , 393, 428 
'.Morgan, J J 8,323,351 
organ Group Mental Test, 351 
orphologic Index, 187, 193, 204, 504, 

507-509 

Morphology, 507 

Motivation in the intervietv, 464 

Motive, 294 

Motor 

impulsion, 340, 343, 346 
inhibition, 341, 343, 346, 347 
Movement 

muscular, factor in psychogalvanic 
reflex 427 

speed of, 339. 343-u6 
Movies, Scale of Attitude toward, 229, 
230 

Moving coil galvanometer, 421, 422 
Mowrer, E R , 463 
Multigraphing, 132 
Multiple-choice item, 140 
MuNSTERnERC, H , 25, I36, 36I, 364, 
382 

Muscio. B , 137 
Muscle-reading, 338 
Muscular 

activity, factor in acidity of urine, 
439 

movement, factor in psychogal- 
vanic reflex, 427 

theory of psychogalvanic phenom- 
enon, 425, 426 
Mystery Man Test, 314 


Naccarati, S , 193, 507, 508 
Napoleon 506 
National Honor Society, 12 
National Intelligence Test, 287 
National Research Council, 174 
Natural 

response, 299 
situations, 299, 300 
Neatness, 528, 529 
of the home, 548 
Negative affective tone, 377 
Negative question, 137 
Negro, 350 
Nervous, 

habits, 31, 35 
sympathetic, system, 400 
Nested Squares Test, 306, 563 
Neurasthenia, 180 
Neurasthenoid, 180 
Neurosis, 488 

Neurotic Inventory, Thurstone, 208 
Newark Survey, 151, 152 
Newcomb, A , 529 ' 

Kxwcomb, T M, 32 
jr-type test, 222 


Nitrogen, amino-acid, 438 
Non-compliance, 341, 343. 346 
Non-verbal elements in the interview, 
472, 473 

vs \crbal diagnosis, 17, 18 
Normo-splanchmc, 507 
Norms, 169, 374 
Norsworthy, N , 98, 104 
Note-taking, 47, 479, 494 

preliminary to rating, 46, 47 
Number 

Checking Test, 311, 312 
of Items in a rating scale, 83 
of observations, 33 
of questions, 132 

of scale-divisions in rating, 78-80 
of social situations, observation of, 
36 

square, 324 

Nonblrg, H, 379, 380 


Objective 

associations, 370 , 

definition in rating, 85 
record, rating should be, 45 
■test, 222, 556 

reliability of, 168 
trait, 106, 107 
ObjecUvit^ m rating, 76 
Observableiii^ns in a rating scale, 84 
Observation, 7, 12, 13, 17, 23-40, 298, 

450, 567. 569 

accuracy of, increased by prac- 
tice,* 27 

as measurement, 32-36 
conditions of, 36-38 
consistency m, 134 
control of situation in, 56, 37 
defining the objects of, 30 
description of item in, 32 
diary in, 31, 38, 39 
directed, 30, gi, 38 
discrimination essential to, 27 
errors of, 24, 498 
essentials to, 24-30 
fallibility of, 15 
finding, 30, 31, 38, 39 
freedom from excitement, essen- 
tial to, 30 

increased by practice, 27 
increasing reliability of rating, 99, 
108 

in the interview, 37 
occurrence of an item in, 33 
of behavior, 17 

of growth or development, 34, 38 
of number of social situations, 36 
of social contact, 36 
of time spent in social situation, 36 
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Obsenation (continued) 

participatinn in the situation in, 17 
period, length of, 37, 38 
preliminary to rating, 46, 47, 114 
record of, 28 
recording, 38, 39 
reliability of, 31, 33-36 
seating plan in classroom, 39 
selection in, 30 
selection of items m, 31, 32 
self-, 18 
svstematic, 49 
tunc intcn.Tl in, 33 
unit of measurement in, 33 
Observations 

number of 33 
spacing of, 33 

Observers, api cement of, 34 
Occupational 

differences, in introversion, 202, 
203 

intelligence standards, 540, 541 
level, measures of, 536-545 
Preference, Questionnaire of, 245, 
246, 248 

Status, Barr Scale of, 542-545 
Occupations 

classifications of, 537-539 
Counts 538 539 
industrial, 537 
Taussig Scale of, 538 
Oedipus complex-, 489 
Offense rating 279-284 

Clark Scale of, 279, 281 
Office interMCW 459 
OliXsjder E 1 10 

Oi-soN fV C 24, 31, 33-36, 38, 68 
Omission, errors of, m testimony, 475 
Opinion 

certainty of, 233, 234 
extreme 233 234 

Public \fatson Surrey of on Some 
Religious and Economic Issues, 
142, 166 216-222 234 235 
tests of 167 562-567 
Opposites Test 270 285, 286, 288 
Opposition resistance to, 341 
Optimism in rating, no 
Oral examination, 478 
Order-of-mcrit 

judgments, combining, 86, 87 
method of rating, 55, 76 
Ordinary-Ideal, Se'f-Rating, 74-76 
Organic 

inferiority, 492 
mental disorder, 431 
Organiration, verbal, 566 
Organs sense, 24 

Original inquiry, questionnaire used in, 
I2S 


O Rourke, L J , 66, 454, 

Orr, C 1 , 263, 264, 552 
Orr Good Manners Test, 2645., 270, 286, 
SS2 N 

Oscillograph, cathode-ray, 424 t 
Otis, A S , 267 \ 

Otis, M, 320, 321 
Outer associations, 368, 374, 376 
Overrating, 74, 109, no 
Overstatement Test, 308-310 


Pain 

factor in blood pressure, 413 
factor in breathing, 419 
Paired comparisons, 

method of 54, 55, 76 
statistical treatment of, 86-89, 93 
Palmer, V M , 463 
Palmistry, 503, 504 
Paper 

and pencil test, 12, 402 
Paraffin Test 303, 304 
sensitive 403 
smoked 402, 403 
Parables tests, 267, 268 
Paraffin Paper Test, 303, 304 
Paranoia, 387 
Parental conditions, 548 
supervision, 548 
Paresis, 387 
PvRTEN, M, 33 

Participation m the situation, in ob- 
servation, 37 

Paterson, D G, 61, 83, 94, 97, 509, 
522, 523 

Pathological states, freedom from, es- 
sential to observation, 27 
Patient, 493, 494 

psychopathic, 350 
Pearson, K , 53, 78 
Peg, Puzzle Test 307 
Pelotte, 404 
Percentages 138, 142 
Percentile scores 169 
Percentiles, turning ranks into, 91, 93 
Perception, 

essential to observation, 28, 29 
fallibility of, IS 
inaccuracy in, 29 
method, 392, 393 
Test, Ethical, 268-279 
Performance tests, 12, 113, 298-360, 

56s, 567, 569 

of introversion, 200 
Permanency of interest, 239-342 
Perry, C A , 546 
Perseveration, 379, 381 

volitional, 342, 343, 346 
Persing, K M, 150, 154 
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Persistence tests, 302, 321-325, 339, 
353. 562-56+, 566 
intct correlations of, 325 
reliability of, 325 
Personal 

answer, 291 
Attitudes Test, 74 
reliability of, 76 
Data Sheet, 174 
Inventory, 179 

Colgate, 158, 165, 180, 196, 
201, 202 
Personality, 437-440 
analysis of, 82 
and character tests, 266 
integration of, 377 
Inventor}', Bernreuter, 208 
raeasu'-ement of, 560, 561, 567, 
568 

Rating Scale, 72 
ratings, 519, 520 
Report, 70, 71 
Schedule, 183-185, 187 
Personnel spirit, 42, 43 
Perspiration, 401 
Peters, C C , 56 
Peterson, F , 376, 377, 428, 430 
Phenomena 
Fern’s, 421 
Tarchanoff’s, 421 
Phonograph, 393 
Photographs, 504, 521, 522 

estimating character from, 521, 
522 

Phrenology, 503-505 
Physical 

contact in the interview, 467 
examination, 567 

Physiognomic measurements, 518, 
520 

Physiognomy, 504 

Physiological measures of the emotions, 
see Emotions 
Physique, 504-514, 561 
measures of, 567 
Picture inhibition test, 563, 566 
Pituitary gland, 4+0 

factor in blood picssurc, 413 
Place, M J , 100, 160 
Planted Dime Test, 314 
PIcthysmograph, 415, 416 
Pneumograph, 417, 418 
Poffenbercer, a J , 130, 136, 350 
Polarixation, 424-427 
initial, 425 

of electrodes, factor in psychogal- 
vanic reflex, 427 
Porter, E , 266 
Portraits, character, 562, 566 
Positive affective tone, 377 


Posture, 13 

factor in breathing, 419 
variations of pulse with, 406 
Potassium salts, 405 
Potcntiomclci , 423 
Practice, 371, 374-376. 378, 475 

accuracy of observation, increased 
by, 27 

Prediction of school achievement, 244, 
38S 

Preference, occupational, question- 
naire of, 245, 246, 248 
Pregnancy, factor in blood pressure, 413 
Prejudice, 218-220 

freedom from essential in observa- 
tion, 29, 31 
measure of, l6l 
Preliminary letter, 130 
Preparation 

for the interview 458-460 
of question blank, 130-132 
of questions, 132-138 
Pressey, S L , 161, 164 165, 168, 189, 
iQt, 193. 195, 281, 478, 508 
Pressey X-0 Tests 161, 164, 165, 168, 
189-195, 508 
reliability of, 191, 192 
Pressure 

barometric, factor in breathing, 
419 

blood, see Blood 
diastolic, 410 

of electrodes, factor in psychogal- 
vanic reflex, 428 
systolic, 409, 410, 432-434 
Preview, in interviewing, 458 
Prcycr, W , 52’5 
Pride \tj, E , 320 
Principles 

as a control of conduct, 291, 292 
Test, 274, 275, 277, 285-288, 290 
w'cigh and choose, 277 
Privacy in the interview, 459 
Probability, 52 

integral, normal, Kcllcy-Wood table 
of, 68, 91, 551 

Probable error, weighting the, 148, 149 
Problem 

defining, in questionnaire studies, 
127, 128 

Record, Behavior, 68 
Problem children, 9 
diagnosis of, 17 
Problems, 

Arithmetic, Test, 311, 312 
behavior, 186 

Procedures in interviewing, 459-474 
Process interview, 479, 480 
Profile, 337, 342, 349 
facial, 504, 519, 520 
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Prognosis 

of crime, 8 

of impro\cmcnt, 9, 10 
of mental balance, 9 
Projection, 490 
Protestants, 265 
P'oterbs Test, 267 
Provocations Test, 272, 273, 276, 285- 
288, 290 

Psychastlienia, 180 
Ps) chasthcnoid, 180 
Psychiatric examination, 17 
Psychiatrist, 10, 17, 451, 456, 560, 
567 

Psychoanalysis, 6, 12, 14, 144, 145, 485- 
S02, 567 S68 

a technique of diagnosis 485 
a thcriipcutic measure, 485 
errois of interpretation in, 498 
Psychoanalyst, 138 
Psychoanalytic 

technique, 493-497 

evaluation of, 498-501 
theory, 485-493 
Psychogalvanic phenomena 

muscular theory of, 425, 426 
sweat theory of, 425, 426 
vasomotor theory of, 425, 426 
Psychogalvanic reflex, 379 
factors influencing, 426-429 
measurement of, 420-429 
Psychological questionnaire, 15, 16, 
139-171 

Psychologist, 17 
applied, 503 
Psychology 
fatuity, 22 
individual, 492 

Psychoneurotic Inventory, 146, 154, 
168, 169 171, 174-214, 567 
reliability of, 183 186 
validity of 186-188 
Woodworth, see Woodworth 
Psychopathic 
patients, 350 
states, 394 
Psychopathy, 386 
Psychosis, 512 
Pu, AST, 109 
Pulse, 379, 403-408, 415, 434 
amplitude, 415 
factors in breathing, 418 

rate, 401-407, 4 IS 
variations of, 405-407 
Pupil Data Sheet, 150 
Puzzle 

Fifteen, Test, 307 
Manipulation Test, 327 
Peg Test, 307 
Pyknic type, 509-512 


Qualitative analysis of free association, 
384 

Qualities, 

character, ii 

definition of, in rating, 84-86, 1 14 
to be rated, 

selecting, 61, 8z 
weighting, 82, 83 

Quantitative scores, yielded by ratings, 
45 

Queen, S A , 473 
Question 
blank 

arrangement of, 13 1 
design of, 130-132 
preparation of 130-132 
size of, 130, 13 1 
difficulty of, 470, 471 
direct, 142, 143 
incriminating, 137, 138 
leading, 135-137, 471 
negative, 137 
subtle, 142, 143 
suggestive, 137, 471 
yes-no, 140 

Questionnaire, 6, 13, 14, 16, 18, 19, 22, 
122 - 173 , 550 , 569 
accuracy of, 453, 454 
adjustment, 12, 174-214 
administration of, 128-130 
anonymous, 453 
appeal of, 130 

Ascendance-Submission, see As- 
cendance 

attitude, see Attitude 
checking in, 134, 135 
— compared with interview, 451-454 
consistency of, 149 
disguised, 14, 142, 143 
distribution, types of, yielded by, 
170 

duplicate, 128 

Garretson Interest, see Garret- 
son 

interest, see Interest 
introversion-extroversion, see In- 
troversion 

Occupational Preference, 245, 246, 
248 

on Cultural and Alechanical En- 
vironment, 553 
psychological, 15, 16, 139-171 
Psychoneurotic, see Psychoneu- 
rotic, Woodworth 
rating in a, 141 
reliability of, 161-169 
returns on, 129, 130 
scoring, methods of, 156-161 
selection of items for, 147-149 
sex differences in, 171 
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Questionnaire (continued) 

social attitudes, 137, 138, 166, 213, 
216 

studies, defining problem 111,127,128 
studiousncss, see Studiousness 
tabulation in, 135, 136 
to obtain facts. 123, 123-138 
to study individuals, 123 
trying out, 128 
used in original inquiry, 123 
validity of, 147-136 
Woodworth Psychoneurotic, see 
Woodworth 
Woodworth-Cady, 164 
Woodworth-Mathews, 164, 193 
Questions, 436 

in the interview, 469-472 
choice of, 471, 472 
number of, 132 
of environment, 146 
of reactions, 146 
preparation of, 132-138 
should be brief, 132, 133 
should be simple, 133 
should be specific, 133, 134 
should be unambiguous, 133 
should cover information desired, 
133 

Quotient, accomplishment, 333 


Racial 

attitudes, 223 

differences, 171, 187, 330-333 
Rademacuer, E S , 465, 474 
Radial artery, 403, 404, 409 
Radical, 234 

economic, 217-221 
Rank, 0, 486 

Ranking, 33, 76-78, 97, g8, 107 
combining incomplete, 92, 93 
offense, 279, 280 
statistical treatment of, 90-93 
Ranks, turning 

into percentiles, 91, 93 
into standard deviation units, 91, 
93 

Rannflls, M E , 464, 466 

Rapport, 461, 462 

Rate 

of breathing, measurement of, see 
Breathing 

of heart-beat, measurement of, 
403-407 

pulse, 401-407, 413 
Rater’s manual, 47, 48 
Raters, training, 47-30 
Rating, 41-J2I, 347, 348, 330, 430, 318, 
362, 363-567 

acquaintance factor m, loB, 109 


Rating (contmtied) 

administration of, 44-30 
advantages of, 41-44 
altruism, in, 110 

as an aid to administration, 41, 42 
average error in, 48-30 
blanks, 44, 45 
chart, 63, 64 
checking sheet in, 46 
conduct, 32 
confidence in, 103 
consistency of, 102, 103 
cross-on-a-lme method of, 62 
definition of qualities in, 84-86, 
114 

objective, 85 

differences in traits in reliability 
in, 104-108 

direction of error in, 48-30 
factois influencing reliability of, 
98-108 

generosity factor in, 96-98 
graphic, see Graphic 
improvement in, 114 
m a questionnaire, 141 
independent, 46 
judgment m, 31 
man-to-man, see Man-to-man 
methods, 6, 12, 13, 41-121, 369 
comparison of, 76-78 
note-taking, preliminary to, 46, 47 
number of scale divisions m, 78-80 
objectivity in, 76 
offense 279-284 
one indiv idiial at a time, 80, 81 
one trait at a time, 80, 81 
optimism in, no 
order-of-merit method, 33, 76 
paired comparison, 34, 53, 76 
peisonality, 319, 320 
preceded by observation, 46, 47, 
114 

purposes of, 41 
regression in, 104 
reliability of, 93-96 

effect of acquaintance on, 98, 
99 

factors influencing, 98-108 
mcreased by observation, 99, 
108 

scale. 

Army, 37-61, 63, in, 114 
Behavior, 69 
distribution m, 80 
graphic, see Graphic 
Items observable m, 84 
number of classes in, 78-80 
number of items in, 83 
of offenses, Clark, 279, 281 
Personality, 72 
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Rating (continued) 
schedule, 42 
self-, I09-III, 141, 234 
Self-Oi dinary-Ideal, 74-76 
sex difference in, 101, 102 
sheets 45, 46 

should be objective record, 45 
traits, 52 

used in a bank, 47 
validating test items by, 147 
validity of, 108-113 
variability in, 93 
Ratings 

as measure of character, 49 
as measuic of citizenship 49 
certain, 103 

character 49, 521 527, 528 
combining incomplete, 92, 93 
composite of, 114 
extreme 103, 114 
given periodically, 44 
given systematically, 44 
independent, 46, 95, 96 
make judgments analytical, 43 
make judgments representative, 43 
methods of making, 52-76 
provide data for research, 43, 44 
recording 54 

stimulate the person being rated, 
42 

systematic, 114 
yield quantitative scores, 45 
Ratio 

accomplishment, 333 
inspiration-expiration, 435 
Rationalization, 142, 219, 221 222, 489 
Raudenheimer, a S , 279, 281, 284, 
287 289, 302, 308, 310 
Ray, cathode, 424 
Reaction 

Study, Ascendance-Submission, see 
Ascendance 

tune, 373-378, 381, 384, 394. 430. 

431 

age differences in, 375 
average, 384 

average deviation of, 384 
of educated, 375, 376 
of intelligent, 375 
sex differences in, 375, 376 
to contradiction, 340, 341 
Reactionary, 234 
Reactions 

common, 370, 371, 385, 391 
description of, 12, 13 
doubtful, 386 

emotional, in the interview, 467, 
468 

group, 372, 374 
mdividual, 370, 371. 385, 391 


Reactions (continued) 
infantile 145 
measures of, 12 
questions of, 146 
Reading, 

character, 528 

Test, Thorndike-McCall, 235 
Reaxi, M J , 91, 242, 246 
Reasoning tests, 287 
Recognition tests, 274, 277, 285-288, 
290 

Record 

Behavior Problem, 68 
conduct, 563, 564 
cumulative, 46 
diaiy, 31 

objective, rating should yield, 45 
of observation, 28 
Recording 

m the interview, 479, 480 
observation, 38, 39 
ratmgs, 54 

time in free association, 364 
Rectangular distribution, 77. 91 
Reflex, psychogalvanic, see Psychogal- 
vanic 

Regression, 490 
m rating, 104 

Relay, Bean Test 313, 314 
Release, mental, 468 
Reliability, 5, 568 

diflercnces in rating traits, 104- 
108 

loss of, 79 

of Ascendance-Submission Reac- 
tion in Study, 207 
of attitude questionnaires, 231- 

of Burdick Apperception Test, 552 
of check list, 73 

of conduct knowledge and judg- 
ment tests, 284, 285 
of diagnosis, 20, 21, 22 
of Downey Will -Temperament 
Tests, 343, 344, 351 
of free association classification, 

372 

of graphic rating scale, 95 
of Guess Who Test 73, 74 
of interest questionnaires, 252, 253 
of interviewing, 475-479 
of introversion-extroversion ques- 
tionnaires, 201 
of judgment tests, 284, 285 
of man-to-man rating, 95 
of measures of deception, 316, 317 
of objective tests, 168 
of observation, 31, 33-36 
of persistence tests, 325 
of personal attitudes tests, 76 
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of Pressey X-0 Tests, 19 1, 192 
of Psychoneurotic Inventory, 185, 
186 

of questionnaire i6i-i6g 
of ratings, sec Rating 
of self-estimate, 99 
of Sims Scale of Socio-Economic 
Status, SS2 

of Studiousness Questionnaire, 227 
Religious 

and Economic Issues Watson Sur- 
vey of Public Opinion of, 216- 
222, 234, 23s 
education 

Drew Tests in, 268 
Indiana Survey of, 302 
tests, 222, 223 
Ideas, Chassell Test of, 267 
Remmehs, H H, too, 110 
Repetition of stimulus word, 378, 394 
Report, 391, 47S 

personality, 70, 71 
Repression, 488-490 
Reproduction 366, 377, 379, 391, 394 
Reputation, 562-564 
Research 

bibliographical, 126 
interview, 450, 451, 458 
ratings provide data for, 43, 44 
Resistance, 489, 490, 501 

body, factor in psychogalvanic re- 
flex, 427 
breaking, 468 

capillary, factor in blood pressure, 
411 

electrical, 423, 427 
in interview, 458 

meeting 464-468 
to opposition, 341 
to suggestion, 562-564 
Response 

classification of, in free association, 
368-371,388 
controlled, 299 
directed and undirected 299 
factors conditioning in free associa- 
tion, 371-373 
fluidity of, 339 
measuring, 536 
natural, 299 
speed of, 339 

Results of conduct, description of, 12, 

measures of, 12 
Retrospection, 144-146 
Returns on questionnaire, 129, 130 
Rheostat, 423 
Rice, S A, 216, 233 
Rich, G. J , 437-440 


595 

Right, deviation from accepted idea 
of, score, 74, 75 
Riklijj, Er , 361 
Ring, Chinese Test 324 
Rising action in interviewing, 460, 462- 
473 

Riva Rocci sphygmomanometer, 408-410 
Riv'ers, W H R, 491 
Rogers, C , 500 

Rosvnoie, a J, 361-363, 370-3721 374 f 
377, 381, 385-388 390 
Rosvnofe, I R, 372, 388 
Ruen, G M , 168, 186, 279, 289, 308, 
310, 346, 348, 351 

Rucc, H 0, 95, 96, 107, III, 1 14, 
123, 126, 128, 131, 135 

Rules 

for making graphic rating scale, 

65, 66 

Test, Five, 272 
Rome, B , 61 
Rusk, R R, 368, 375 


Safe Manipulation Test, 327 
Saliva, 437, 439 

factors in acidity of, 439 
SALSDERRy, P , 469, 470 
Sampling, 5, 474 
Saudek, R , 325, 526 
Scale, 19 

Army Rating, 57-61, 65, iii, 114 
Behavior Rating, 69 
Binct-Simon Intelligence, 15, 267, 
268, 351 

-divisions, number of in rating, 78- 
80 

for judging mtelligence, Pearson’s, 

53, 54 ^ , 

giaphic rating, see Graphic 
Group, for Investigating the Emo- 
tions, 189, 192 

Heidbredcr Inferiority Attitude 
Sclf-raling, 509 
medical interest, 248, 249 
men, 61 

number of items m a rating, 83 
of Attitude toward Movies, 229, 
230 

of measurement, traits as, 50-52 
of mental imagery, 53 
of Occupational Status, Barr, 542- 
545 

of Occupations, Taussig, 538 
Personality Rating, 72 
rating see Rating 
Sims, of Socio-Economic Status, 
see Sims 

Whittier, for Grading Home Con- 
ditions, 548, 549 
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Schedule 

case study, 15, 16, 556, SS7 
interview, 454, 461 
Peisonality, 183-185, 187 
rating, 42 

Schizoid temperament, 180, 510-512 

Schizophrenia, 180 

Schizophrenic, 509-512 

Scholarship, 12 

SciixLiDEMUiiL, G , 525 

School 

achievement, prediction of, 388 
effort in, measures of, 333-336 
honesty, 562-564 
marks, 79, 478 562-564, 566 
Textbooks Church, Score Card for 
Measuring, 56, 57 
Sciixv rcLFR, R A , 203-205 
SciiwEsiNccR, G C , 260, 261, 270, 
285, 289 

ScHWESiNGER, Ethical-Social Vocabu- 
lary Test, 270, 285, 286, 288 
Scientific method, 23 
Scientific sj'nthesis, 560 
Score, 

affcctivity, 161, 191, 192 
Card, 55. 56 . 83 

for Measuring Church School 
Textbooks, 56, 57 
for Mcasui mg the Home 547, 
548 

Sims, for Socio-Economic 
Status, see Sims 

criticism of axerage boy, 74, 75 
deviation from accepted idea of 
the right, 74, 75 
emotionality, 191-193 
fact, 306, 311 
feeling of difference, 74, 75 
idiosyncrasy, i6t, 191-193. 
inferiority, 74, 75 
of amount, 161 
percentile, 169 

quantitative, latings yield, 45 
self-criticism, 74, 75 
social insight, 74 75 
superiority and inferiority, 74, 75 
Scoring 
key 

determination of, 249, 250 
by consensus of judg- 
ment, 157, 158 
for free association, 390 
questionnaires, methods of, 156- 
161 

Scott, Sir Walter, 525 
Scott, W D , 57, 61 
Scout, Boy organization, 302 
Seating-plan, in classroom observation, 
39 


Secretion, glandular, 440 
Security of contact of electrodes, fac- 
tor in psjrhogalvanic reflex, 427 
Selection 

in observation, 30 
Item, see Item 

of qualities to be rated, 81, 82 
of teachers, 472 
Self 

-confidence, 341, 343, 346 
-criticism, scores, 74, 75 
-estimate, reliability of, 99 
-observation, 18 
-ordinary-ideal rating, 74-76 
-rating, 109-111, 141, 234 
-rating Scale, Heidbieder Inferior- 
ity Attitude, 509 

Sense 

of humor, 457 
organs, 24 
Sensitive paper, 403 
Sensory stimulation, factor in psycho- 
galvanic reflex, 429 
Sentence 

Completion Test, C E I, 311, 
312 

method, 393. 394 
tests, 271-277 
Seriousness of an act, 293 
Service, 12 

tests of. 325-327, 562-564 
mtercorrclations of, 326 
Sex, 486-489, 491-493, 496 

and handwriting, 526, 529 
differences 

m attitude, 235 
m complex signs, 380 
in conduct standards, 290 
in introversion-extroversion, 
204 

in moral standards, 290 
m questionnaires, 171 
in rating, loi, 102 
in reaction time, 375, 376 
in testimony, 475 
variations 

of blood pressure with, 41 1 
of pulse with, 405 
Sexual drives, 487, 488 
Shape of hand, 504, 524 
Sharp, F C , 268, 269 
Sheet, 

checking, in rating, 46 
personal data, 174 
Pupil Data, 150 
rating, 45, 46 
Sheldon, W H , 508, 509 
Shen, E, 94, 98, 99, 106, 109 
Shimberg, M E , 268, 289 
Shuttleworth, F K , 226, 561-566 
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SiDIB, B , 428 
Signs 

complex, 378-381 

of conduct, external, 12, 13, 503- 

Similarities Test, 270, 286, 288 

Simon, T , 267 

Simple constellation, 370 

Sims, V M , 539, 548, sso, SS2, 5 ^ 3 , 

564 

Sims Score Card for Socio-Economic 
Status, S39, 563, 564 
reliability of, 532 
Sincerity, 456 
Single inteiview, 474 
Situations, 181, 182 

control of, m observation, 36, 37 
controlled and uncontrolled, 299 
natural, 299, 300 

paiticipation in, in observation, 37 
social, observation of number of, 36 
observation of time spent m, 

s» 

factor in blood pressure, 41 
factor in breathing, 418 
of electrodes, factor m psychogal- 
vanic reflex, 427 
of home, 348 

of question blank, 130, 131 
variations of pulse with 405 
-weight illusion, 318, 319 
Skills, 560 

Slang, Vocabulary of, Schwesinger 
Test of, 261. 262 
Slavens, G S , 288 
Slawson, J , 99, 103, 104, 107 186 
Sleep, factor in breathing 419 
Small correlations, value of, 559 
Smith, W W , 375-377, 379, 43° 
Smoked paper, 402, 403 
Snedden, D S , 434 
Snow, A J , 99, 103 
Snyder, A , 290, 292, 293 
Sociability, 309 
Social 

attitudes 


and inteicst test, 224, 223 
Questionnaire, 137, 158, 166, 
213, 216 
Test of, 167 

contact, observation of, 36 
Ethical-, Vocabulary Test, Schwes- 
mger, 270, 283, 286, 288 


importance, basis for selecting 
Items in observation, 31 
information, 233 
insight score, 74, 73 


interest, 389, 390 
level of home, 536 


Social (conUiiiied) 

selection as basis of item selection, 

147, 148 

situations 

observation of numbei of, 36 

obsLivation of time spent in, 36 
worker, 431. 433. 437, 439, 462, 
464, 473, 479, 480 
Socialization, 224 
Socio-economic 

level, measures of, 348, 330-332, 
363 

level of the home, 336, 346 
Status, 

Sims Score Card for, see Suns 
Sommer, R , 420 
Sound associations, 368, 369 
Spaces Test, 307 
Spacing obsci vations, 33 
Speaking, factor 111 breathing, 419 
Sfearman, C E , 33, 93, 201, 233, 307, 
310, 313 323 

Spearman-Brown prophecy formula, 33, 
93, 201, 233, 307, 310, 313, 32s 
Specific trait, 107 
Speed 

of decision, 329-331, 34 °, 343 , 346, 
347 

Intel correlations of measures 
of, 330 

of movement, 339, 343-346 
of response, 339 
tests, 311, 312, 563 
Sphygmograph, 404, 407 
Sphygmomanometer, 408-410 
Spirit, personnel, 42, 43 
Spirometer, 313 
Spurious validity. 148. 149 
Square, magic numbci and word, 324 
Test, Magic, 314 
Squares, Nested, Test, 306, 363 
Stability 

emotional, 562-56.) 
mental, prognosis of 9 
Standard deviation units, turning ranks 
into, 91, 93 
Standardization, 568 

of the interview, 6, 454, 455 
vs versatility, 15, 16 
Standards, 169,286 
ethical, 290, 291 

occupational intelligence, 540, 541 
of conduct, 19, 20 
Standing broad jump, 313 
Stanford Revision of the Binet-Simon 
Intelligence Scale, 267, 268, 351 
Stand-outishness, 1 12 
Starch, D , 429 
Starling, E H , 406, 413 
Sta^, H E, 437 
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Statement, true-false, 140 
Statistical 

inquiry, 6, 18 
methods, 6, 7, 20 
treatment of paired comparisons, 
86-89, 93 

treatment of ranking, 90-93 
Stealing, 303 
tests of, 314 

Steps in interviewing, 459-474 
Stethoscope, 409 
Stigmata, 504 514-516 
Stimulation, sensorj' factor in psycho- 
galvanic reflet 429 
Stimulus word, repetition of, 378, 394 
Stop-watch, 365 
Story 391, 392 

Inhibition Test, 327, 328, 563 
Test, 277 278 
STR^rcc, R , 263 

Strong E K. , Jr , 148, 160, 242, 248- 
250. 253, 254 

Student Information Blank, 226 
Studiousncss 

Indc\, 334-336 
measures of, 333-336 
Questionnaire, 142 166, 226, 227 
reliability of, 227 

Study, 

case, see Case 

individuals, questionnaire to, 125 
of the individual, comprehensive, 

SSS-369 

Stupor, catatonic, 431 
Sturtevant, S M , 461 
Subjective 

associations, 370 
definition in rating, 85 
Sublimation, 491 

Submission, ascendance- see Ascend- 
ance 

Substitution Test, Digit-Symbol, 312 
Subtle Question, 142, 143 
Sugar 400 401 

Suggestibility, tests of, 318-321 
Suggestion 

in the interview 452 
resistance to, 562-564 
Suggestive question, 137, 471 
Summarv , diagnostic, in the interview, 
480 

Superiority score, 74, 75 
Supervision, parental, 548 
Supervisor 42 
Suppression, 489 
Survey 

Watson’s of Public Opinion on 
Some Religious and Economic 
Issues 142 166 2 t 6-222, 234, 23s 
Sutherland, J W,; 78 


Swallowing, variations of pulse with, 
406 

Sweat glands, 400, 421, 425-427 
Sweat theoiy of ps> chogalvanic phe- 
nomena, 425, 426 
Sweet L , 74, 76 
Symbolism in dreams, 496 

Symonds, P M , 30, 37-39. 73. 76. 79. 
98, 1 12, 1 13, 142, 154, 15s, 157, 
166, 171, 182, 185, 215, 226, 232 
23s. 251. 255. 334 
Sympathetic nerv'ous system, 400 
Synthesis, scientiflc, 560 
Systematic 

observation, 49 
ratings, 114 

S}Stolic pressure, 409, 410, 432-434 
Svz, H C , 431 

Tabulating machine, Hollerith, 131, 132 
Tabulation, 
forms, 131 

m questionnaire, 135, 136 
Tachistoscope, 392 
Tambour, 403, 418 
Tanaka, S , 290, 294 
Tannenbaum, S a , 499 
Tarciianoff, J , 420, 421 
Tarchanoff's phenomena, 421 
Talbestand, 391-393 
T.vussio, F W , 537-539 
Taussig Scale of Occupations, 538 
Teacher 

marks, 562, 564, 566 
selection, 472 
Technique, 

double testing, 308-312 
duplicating, 303-306 
improbable achievement, 306-308 
of diagrams, psychoanalysis as, 485 
of psychoanalysis, 493-497 
psychoanalytic, evaluation of, 498- 
501 

Techniques 

diagnostic, 6, 7, 15 
for lessening tension and meeting 
resistance, 464-468 

Temperament, 507-51Z, 514. 525. 529. 
561 

Dowmey Will-, Test, see Downey 
melancholic, 393 
Temperature 

factor m breathing, 419 
of electrodes, factor in psychogal- 
vanic reflex, 428 
variations of pulse with, 405 
Tension, emotional, 394 

lessening in an interview, 464-468 
Terman, L M, III, 179, 243, 302, 
310. 545 - - 
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Terman Group Test of Mental Ability, 
351 

Test, 13, 19, 22, 122 
Addition, 311, 312 
American Council Intelligence, 187, 
202, 508 

Applications, 275-277, 285-288, 

290 

Arguments, 219, 221 
Arithmetic Problems, 311, 312 
Army Alpha Intelligence, 539-541 
Bean Relay, 313, 314 
Bible Knowledge, Advanced, 266 
Bmet Intelligence, 15, 267, 268, 
3SI 

Binet Vocabulary, 329 
Books Read, 310 
Burdick Apperception, see Bur- 
dick 

Cancellation, 311, 312 
Cardboard, 305, 306 
Cause and Meet, 271 277, 285- 
288, 290 

C A V D Intelligence, 324 
C A V I Intelligence, 564 
Chassell, of Religious Ideas, 267 
Chinese Ring, 324 
Circles, 307 
Coin-Counting, 314 
Colgate Mental Hygiene, 164, 185, 
187, 203 

Completion, see Completion 
Comprehensions, 272-276, 285-288, 
290 

Coordination, 563, 566 
Cross, 324, 563 

Cross-out, 189-195, 217, 270, 281 
Degree of Truth, 217 
Digit-Symbol Substitution, 312 
Disguised, 14 
Dots, 312 

Duties, 271, 276 285-288, 290 
Envelopes, 326, 562, 563 
Ethical Discrimination, 260, 267 
Ethical Perception, 268-279 
Fair-mindedness, i6i, 216, 217 
Fifteen Puzzle, 307 
Five Rules, 272 
Foresights, 273, 274, 277 
Gates-Strang' of Health Knowl- 
edge, 263 

Generalizations, 217, 218 
Guess Who, see Guess Who 
Home, 313 
Inference, 220-222 
InformaVon, 311, 312, 562, 563 
Intelligence, see Intelligence 
Judgment, see Judgment 
Kinetic Will, 322, 323 
Kits, 326, 562, 563J 566 - 


Test (continued) 

Laycock Biblical Information, 266 
Magic Sqiiaie, 314 
Manipulation, 327 
Maze 306, 307 
Mental, see Intelligence 
Money Voting, 326, 563 
Moral Judgment, see Moral 
Moigan Group, Mental, 351 
Mysteiy Man, 314 
National Intelligence, 287 
Nested Squares, 306, 563 
new-type, 222 

Number Checking, 311, 312 
objective, see Objective 
of Ability to Choose Principles, 
277 

of Ability to Weigh Foreseen Con- 
sequences, 277, 284, 287, 289 
of delinquency, 390 
of Eje-Control, 331, 332 
of guilt, 382-385 
of lying, 382-385 
of Slang, 261, 262 
Opinion, see Opinion 
Opposites, 270, 285, 286, 288 
Orr Good Manners, 264, 270, 286, 
SSZ 

Overstatement, 308-310 
Paiaffin Paper, 303, 304 
Personal Attitudes see Personal 
Picture Inhibition, 563, 566 
Planted Dime, 314 
Picssey X-0 see Presscy 
Principles, 274, 275, 277, 285-288, 
290 

Prov'ocations, 272, 273, 276, 285- 
288, 290 
Puzzle Peg, 307 

Schwesinger, Ethical-Social Vo- 
cabulary, 270. 285, z86, 288 
Sentence Completion, CEI, 311, 
312 

Similarities, 270, 286, 288 
Social Attitudes, 167 
Social Attitudes and Interest, 224, 
225 

Spaces, 307 
Story, 277, 278 

Story Inhibition 327 328 563 
Terman Group, of Mental Ability, 
3SI 

Thorndike Intelligence for High 
School Graduates, 187, 255, 324, 
328, 508 

Thorndikc-McCall Reading, 235 
time, 272 

Trabue Completion, 329 
Voelker Completion, 303, 304 
^Weight Discrunination, 307, 308 
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Test (continued) 

Will-Toniprramcnt, see Downey 
Word Consequences, 270, 277, 286, 
288 

X- 0 , see Picsscy 
Festiinonv, 14. 17, 18, 471, 475 
accuracy of 47s 498 
age diflcrenccs in, 475 
error of intcrjuctation in, 475-477 
se\ differences in 475 
Testing 

ability 298 

double technique 308-312 
methods 6, 139-142 
Tests, 556 

achiei ement, 11 
aptitude, 567 

athletic pci forniance 313 562,563 
Biblical kiioulcdec 264-267 
C E 1 , 277, 284 286 
chaiactcr and personality, 266 
cooperation, 325-327 563 
difficulty 311 312 
Diiections Woodnorth-Wtlls 320 
Drew in Religious Education, 268 
fear distr.action 332 
Health Education, 270, 277 
helpfulness, 302 
honesty, 150, 302-318, 562-564 

I E R, 563 

Tnhibition ste Inhibition 
intelligence see Intelligence 
judgiiient Ste Judgment 
knowledge see Knott ledge 
Minnesota College Ability, 508 
moial ludemgnt and knowledge, 
sec Moial 

objective, see Objectit'C 
of cheating, 303-314 
of eomprchension of proverbs, fa- 
bles and parables, 267, 268 
of co]i) ing 303 

of intiotersion performance, 200 
of lying 315 
of opinion, 167, 562-567 
of stealing 314 

of Woid Knowledge, Thorndike, 
261 

paper and pencil, 12, 402 
performance, 12, 113, 298-360, 565 

567, 569 

persistence, see Persistence 
peisonality and character, 266 
Pressey X- 0 , see Presscy 
reasoning 287 

recognition 274, 277, 285-288, 290 
religious education, 22, 223 
sentence, 271-277 
service, see Service 
speed, 311, 312, 563 


Tests (continued) 

suggestibility, 318-321 
trubtw 01 thincss, 302 
vocabulary, 260-263, 287, 31I, 312 
word, 270 

association, 332 

Textbooks, Church School, Score Card 
for Measuring, 56, 57 
Theory 

muscular, of psychogalvanic phe- 
nomena, 425, 426 
psychoanalytic, 485-493 
sweat of psychogalvanic phenom- 
ena, 425 426 

vasoniotoi, of psychogalv'anic phe- 
nomena, 425, 426 
Therapy, 497 

psjchoanalytic, 485 
Thinking 223, 224 

Thomas, D S , 24, 26, 29, 31, 32, 34, 
35 

Thorndike, EL 86 87, 93, 11 1, 144, 
235, 239-243. 255. 261, 287, 302, 
324, 328, 364, 508 

Thorndike Intelligence Test for High 
School Graduates, 187 255, 324, 
328 508 

Tliorndikc-McCall Reading Test, 235 
Thorndike Tests of Word Knowledge, 
261 

Thorndike Word List 261, 364 
Thurstone, L L , 87 89, 91, 170, 
171, 183, 185, 187, 208, 229, 
230 236 

Thurstone T C , 183, 185, 187 208 
Thui stone Neuiotic Inventory, 208 
Ihyioid, factor in blood pressure, 413 
Time 

after eating, factor in acidity of 
uiine, 439 

interval in observation, 33 
reaction, see Reaction 
recording, in free association, 364 
spent in social situation, observa- 
tion of, 36 
test, 272 

Titration formal, method, 438 
Tolman, E C , 375 
Tone, affective, 377 
Tools, 553 

Toopb, H a , 129, 130 
Trabue Completion Test, 329 
Training 

in attitudes, 235 
of interviewer, 457 
raters, 47-50 
Traits, 21, 22 

as a scale of measurement, 50-52 
character, 518, 524 
desirable, ipi, 109, 110 
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Traits {conUnutd) 

differences m reliability of, in rat- 
ing, 104-108 
general, 107 
objective, 106, 107 
rating, 52 

ratings on character 49, 521, 527, 
528 

specific, 107 
undesirable, 109, no 
Transfer, 300, 301 
Transference, 490, 497 
Transformer, 423 

Traube-Hcring wa\cs factor in blood 
pressure, 412 
Treatment interview, 451 
Trow, W C, 109, 329-331 
True-false statement, 140 
Trustworthiness tests 302 
Truth, Degree of. Test, 217 
Truthfulness 
inde\, 3 IS 
of answers, 124 
Tube, Brown, 424 
Types, 367 

blond (e) -brunette, SO-f 522) 523 
convex and concave, of face, Si9) 
S20 

criminal, 314-516 
Kretschmer’s, S 04 ) 509-514 
of distributions yielded by ques- 
tionnaires, 170 
of evidence 12 
of insanity, 10 
of interviews, 450, 451 


UiiRBROCK R S , 339, 343, 344-346, 351 
Uncontrolled situations, 299 
Underrating, 109, no 
Undesirable traits, 109, no 
Undirected response, 299 
Uneasiness, 486-488 
Unit of measurement in observation, 33 
Unpleasant 

affective tone, 377 
associations, 375 
Urine, 437, 438 

acidity of, factors in, 439 
factors m creatinine concentration 
of, 439. 440 

Use of physiological measures of emo- 
tions, 430-437 
U-tube of mercury, 409 


Validating items by rating, 147 
Validity 

of ascendence-submission question- 
naire, 207 


Validity (continued) 

of attitude questionnaire, ZiZ-Z'iJ 
of averages, 152 153 
of interest qucslionnaiic, 253-256 
of introversion-extroversion ques- 
tionnaires, 202-203 
of judgment tests, 285-294 
of knowledge and judgment tests, 
285-294 

of psychoneuroticinventory, 186-188 
of questionnaire, 147-156 
of r.iting, 108-113 
spurious, 148, 149 
Value of small correlations, 559 
Van Dfndurc, J K , 548 
Variability 

in rating, 93 
of judgments, 102 
Variations of pulse, 405-407 
Vaso 

constrictor fibers, 414 
dilator fibers, 414 

Vasomotor theory of psychoagalvanic 
phenomena, 425, 426 
Veracit}', 149-156 
VcRACuiii, 0 , 420, 429, 430 
Verbal 

organuation, 566 
vs non-verbal diagnosis, 17, 18 
Verbs, 374 
Vcihor, 391 

Versatility vs standardization, 15, 16 
Vcrsuchsgcschichtc, 391, 392" 
ViGOURoux, R , 420 
Viola, G, 507 
Vocabulary, 471 
Test 

Bmet, 329 

Ethical-Social, Schwesinger, 
270, 285, 286, 288 
of Slang, 261, 262 
tests of, 260-263, 287, 311, 312 
Vocalion.il 

competence, diagnosis of, 8, 10 li 
counselors, 11 
guidance, 516 

Interest Blank, 169, 250, 253 
VoELKCR, P F, 302-304, 306, 308 
Voelker Completion Test, 303, 304 
Volitional perseveration, 342, 343, 346 
Volume 

blood, see Blood 

Voluntary diagnosis, 14, 15, 568 
interviewing 457-438 
Voting, Money, Test, 326, 563 


Waite, H, 94 
Waller, A D , 426 
War, World, see World 
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Washburn, M T , 383 
Wasiiburne, J N , 207, 208 
Water intak.c, factor m acidity of 
urine, 439 

Watson, G B , 74, 139, 140, ii2, 160, 
161, 16C 181 182, 2i6, 219, 222, 
223, 231, 232 234-236, 266, 268, 
269, 284 

W.itson Survey of Public Opinion on 
Some Religious and Economic 
Issues, 142, 166, 216-222, 234, 
735 

FBB E 46, 94, 100 161 
\\iBrR C 0 , 193, 289 
WsCHSLhR, D , 423 424, 427, 428 
U HIRLIN K , 388 

Weight, 506, 507 

Disci imination Test 307, 308 
factor in acidity of urine, 439 
Weighting 

Items 159-161, 248 249 

McCall-Long method of, 160 
qualities to be lated 82 83 
the probable error, 148, 149 
Wellu in B , 204 

W'ells, F L si, ioo, 102, 103, III, 
320, 365, 3 ( 37 . 371 373 , 374 , 
377 378, 388 
Welis II M 426 428 
VVertheimcr, F I, SI I, 512 
WiRTIIEIMER, M , 391 392, 393 
W'’heatstonc Bridge 422, 423 
Whipele, G M 318, 475 
Whitela, P C, 164, 165 191 
WiiiTLEr, M T , 265, 266 
W'hittier Scale for Grading Home Con- 
ditions, 548, 549 
WicKMte, E K 68 
Willy, L E, 434, 435 
Will. 

factor IB breathing, 419 
factor in psychogalvanic reflex, 429 
Kinetic, Test, 322, 323 
Temperament Test, see Downey 
Willett, G W , 239, 240 
Williams, J II , 548 
Wires, E, 350 
Wish, 207 208 
Wohlgemuth, A , 144, 145 
Wood, B. D, 88, 91, 551 


Woodrow, H , 308 310, 372 
Woodworth, R S, 146, 154, 164, 168, 
169, 171, 174, 176-181, 183, 185- 
>89, 193, 196, 320, 373, 401, 508 
W^oodworth Psychoneurotic Inventory, 
164 174-181, 185-189, 19s, 196, 
208, 401, 508 

Woodworth-Cad}' Questionnaire, 164 
Woodworth-House Mental Hygiene In- 
ventory, 164, 180 

W' ood worth-M athews Questionna irc, 
164, 193 

W^oodworth-Wells Directions Tests, 320 
Word 

association tests, 332 
Consequences Test, 270, 277, 286, 
288 

Knowledge, Thoindike Tests of, 261 
List, Thorndike, 261, 364 
magic, squares, 324 
stimulus, repetition of, 378, 394 
tests, 270 
Words 

abstract, 374 
concrete, 374 
Worker 

case, 555, 560 
clinical, 6, 7 

social, 451, 4 SS, 457 , 459 , 462, 464. 
473 , 479 , 480 

Woild War, 114, 174, 187 337 , 433, 
488, 489, 539 , 56s, 566 
Wresciiner, a, 388 
Wundt, W , 394 
Wylie, A T, 151-153 
Wyman, J B , 389 


X -0 Tests, Pressey, see Pressey 


Yale Psycho-cIinic, 37 
Yes-no question, 140 
Yoakum, C S , 104, no, 245 


Zachry, C B , 499 
Zelehy, L D , 167, 232 
Ziehen, Th , 375 
Zones, erogeneous, 487 




